Spotify Social Network Analysis

Required libraries

The required libraries are specified in the requirements.txt file. You need to install:

pandas
scikit-learn
plotly
matplotlib
seaborn
networkx

Data

If you want to skip the scraping stage, which in total takes cca. 17h, I have published the already scraped data and graph files:

https://drive.google.com/file/d/1E9itO0aoc-UHOPRXewhRZnHuxrQli-Ol/view?usp=sharing

Download the file and unzip it in the folder "data/". You will also get the already generated GEXF and graphML graph files.

Scraping takes a long time due to the Spotify API limits. This limits are not clearly specified and we have to err on the side of caution to be able to continue to scrape.

Scraping times for 10000 artists:

Artists: cca. 3h
Top 10 artist tracks and their audio features: cca. 6h
Writers of the top 10 artist tracks: cca. 23h

Due to the large amount of time it takes to scrape the top 10 song writers we limited the scraping to only top 3 songs per artist which brings the scraping time for writers down to cca. 8h.

Scraping instructions

Edit main.py and set (at the top of the file) your CLIENT_ID and CLIENT_SECRET from Spotify API. In addition, you will have to collect your "Bearer token" from your browser when you login to https://open.spotify.com/ and also save it at the top of the file in the TOKEN variable.

Data analysis and graph generation

Once you have the scraped files in data/raw/ folder you can open the Jupyter Notebook notebooks/scraped_data_analysis.ipynb and run it to load the data, clean it and generate GEXF graph files.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
notebooks		notebooks
scraper		scraper
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Spotify Social Network Analysis

Required libraries

Data

Scraping instructions

Data analysis and graph generation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

a-coding-Kat/SNA

Folders and files

Latest commit

History

Repository files navigation

Spotify Social Network Analysis

Required libraries

Data

Scraping instructions

Data analysis and graph generation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages