The-Netflix-prize-data

Analysis of the dataset that contains the evaluations attributed by Netflix users to a selection of films.

The goal will be to analyze the sparse matrix using some collaborative filtering methods. Implementation of methods such as: associative rules, more precisely through the Apriori algorithm, latent Factor models, paying attention to gradient descent and SVD.

To analize the dataset, a data sample was selected due to the large size of the dataset.

After creating the rating matrix, i.e. a sparse matrix with the user ID as row index, the movie ID as column index and as internal values the ratings for each pair {user_i, film_j}, this matrix has been reconstructed through some collaborative filtering methods.

The different methods mentioned above were developed in an iterative way, considering a number of latent factors equal to 5.

Among the different methods applied to the dataset, the traditional gradient descent algorithm turns out to be the most efficient in terms of time needed to reach convergence.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
GD.py		GD.py
Inizializzazione_per_Gradiente.py		Inizializzazione_per_Gradiente.py
README.md		README.md
READ_ME.txt		READ_ME.txt
SGD.py		SGD.py
SVD.py		SVD.py
campione_per_idUtente.py		campione_per_idUtente.py
creazione_dataset.py		creazione_dataset.py
inizializzazione_matrice_SVD.py		inizializzazione_matrice_SVD.py
main.py		main.py
matrice_sparsa.py		matrice_sparsa.py
regole_associative.py		regole_associative.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The-Netflix-prize-data

About

Releases

Packages

Languages

federicapicogna/The-Netflix-prize-data

Folders and files

Latest commit

History

Repository files navigation

The-Netflix-prize-data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages