Songs to playlist classification using NLP | by Gabriele Albini | Nov, 2022

By Jessie Hobb On Nov 23, 2022

A guided approach to assign new songs to Spotify playlists, using word2vec and logistic regression

This article will present a NLP project aimed at assigning songs to playlists.

Two playlists have been selected from Spotify and, via the Spotify API, information such as: artist, song titles, popularity, etc. was downloaded. The song lyrics data, not available through the API, was obtained using webscraping.

Next, some data pre-processing steps were performed on the raw lyrics in order to train a Word2Vec model and encode the text into high dimensional vectors.

A 2D representation of each playlist was generated using PCA and we finally approached the task of playlist assignment using new songs. This task was solved via a Logistic Regression model and a graphical representation was given.

Here’s an overview of the approach used :

The article is based on songs lyrics present on these two Spotify playlists:

The first is the Global playlist which includes the top 50 songs listened by users across the platform. The playlist is updated daily and it generally includes trending pop songs.
The second playlist is a metal mix, including the top 50 metal songs streamed

The music genres of the two playlists seem quite “far”: let’s verify if this “distance” is also present in the vectors that we will obtain using NLP models and let’s confirm if that will help us in a playlist assignment task.

1.1 Downloading playlist information with the Spotify API

In order to use the Spotify API, first of all, we should create a developer account on https://developer.spotify.com/ and create a new app.

Next, by clicking on our app, we can obtain:

With the above information, we can connect to the API:

A guided approach to assign new songs to Spotify playlists, using word2vec and logistic regression

This article will present a NLP project aimed at assigning songs to playlists.

Next, some data pre-processing steps were performed on the raw lyrics in order to train a Word2Vec model and encode the text into high dimensional vectors.

Here’s an overview of the approach used :

The article is based on songs lyrics present on these two Spotify playlists:

The first is the Global playlist which includes the top 50 songs listened by users across the platform. The playlist is updated daily and it generally includes trending pop songs.
The second playlist is a metal mix, including the top 50 metal songs streamed

1.1 Downloading playlist information with the Spotify API

In order to use the Spotify API, first of all, we should create a developer account on https://developer.spotify.com/ and create a new app.

Next, by clicking on our app, we can obtain:

With the above information, we can connect to the API:

Read original article here

Denial of responsibility! Techno Blender is an automatic aggregator of the all world’s media. In each content, the hyperlink to the primary source is specified. All trademarks belong to their rightful owners, all materials to their authors. If you are the owner of the content and do not want us to publish your materials, please contact us by email – [email protected]. The content will be deleted within 24 hours.

Songs to playlist classification using NLP | by Gabriele Albini | Nov, 2022

A guided approach to assign new songs to Spotify playlists, using word2vec and logistic regression

1.1 Downloading playlist information with the Spotify API

1.2 Scraping song lyrics

1.3 Data Pre-processing

1.4 Playlists overview with World cloud

2.1 Word2Vec model overview

2.2 Probability approximation

3.1 Word2Vec Model Training

3.2 Plotting playlist centroids

3.3 Song Classification with Logistic Regression

Data pre-processing:

Hyperparameter tuning:

Model training & testing:

Graphical overview:

A guided approach to assign new songs to Spotify playlists, using word2vec and logistic regression

1.1 Downloading playlist information with the Spotify API

1.2 Scraping song lyrics

1.3 Data Pre-processing

1.4 Playlists overview with World cloud

2.1 Word2Vec model overview

2.2 Probability approximation

3.1 Word2Vec Model Training

3.2 Plotting playlist centroids

3.3 Song Classification with Logistic Regression

Data pre-processing:

Hyperparameter tuning:

Model training & testing:

Graphical overview: