Diversity Scoring on MovieLens

Overview

In this repository we implement Spotify's Generalist-Specialist score on the MovieLens dataset.

The code here accompanies the Medium article "Different Strokes for Different Folks at Spotify".

Installation

We recommend using Anaconda to create a virtual environment. Install the requirements via

conda install --yes --file requirements.txt

or if using pip, run

pip install -r requirements.txt

Running the Notebook

To run the notebook, simply start a Jupyter notebook session via jupyter notebook. The notebook should be able to download the MovieLens dataset and unzip it in the current directory.

Warning!

Training of the model and scoring the generalist-specialist (GS) scores of all users on the MovieLens dataset can be time consuming.

Results

The results of the Shannon entropy and GS scores for MovieLens users are shown here:

As seen from the histogram above, the majority of users have a wide range of movies watched and rated. The spike at the 1.0 bin are mainly due to users who have only watched and rated a single movie. There are 857 users who watched and rated more than a single movie with a GS score of above 0.90 using our trained Word2Vec model.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
assets		assets
core		core
.gitignore		.gitignore
Diversity Scoring on the MovieLens Dataset.ipynb		Diversity Scoring on the MovieLens Dataset.ipynb
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diversity Scoring on MovieLens

Overview

Installation

Running the Notebook

Results

About

Releases

Packages

Contributors 2

Languages

License

ptuls/movielens-diversity-metric

Folders and files

Latest commit

History

Repository files navigation

Diversity Scoring on MovieLens

Overview

Installation

Running the Notebook

Results

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages