-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* docs: update README * feat: update paths and step preprocess * feat: update init_train * feat: update main and query
- Loading branch information
Showing
12 changed files
with
95 additions
and
60 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,10 +1,31 @@ | ||
# Blueflix Streamlit | ||
|
||
The idea of starting a project on [Streamlit](https://streamlit.io) came from wanting to study a new tool. With ease of streamlit provides for the development of projects is incredible. I had done a project in college of a jupyter notebook being for movie recommendations, after that I continued with [Kaggle](https://www.kaggle.com/code/sc0v1n0/k-means-recommend-movies-and-tv-shows-netflix). Wanting to learn Streamlit why not combine the two projects. So I developed this project to apply this knowledge. I hope to write an article about the development. For now I'm still in development, for new updates. | ||
When I was in college I had developed a project with jupyter notebook, which consumes data from the [Netflix Prime Video Movies and TV Shows](https://www.kaggle.com/datasets/shivamb/netflix-shows) set. The idea was to use this set of data, clean, analyze and develop a stage where I could recommend movies and TV shows. | ||
|
||
## Datasets: | ||
I was very happy with the result. But I wanted more, I wanted to take this notebook and transfer it to an application where I could interact with the project. So create a personal project where I can use what I studied and learned over time. | ||
|
||
- [Disney+ Movies and TV Shows](https://www.kaggle.com/datasets/shivamb/disney-movies-and-tv-shows) | ||
But something was missing, which was how am I going to show this result of my project. During that time I discovered this tool [Streamlit](https://streamlit.io), ohhhhhhhhhh!!!!! Incredible !!! The flexibility I gained using it was very good and in addition to being able to deploy using their platform, this way I can show what I did. | ||
|
||
I want to thank **Kaggle - @shivamb**, for making the sets below available. In addition to the Netflix set, there are 3 more. | ||
|
||
- [Netflix Movies and TV Shows](https://www.kaggle.com/datasets/shivamb/netflix-shows) | ||
- [Hulu Movies and TV Shows](https://www.kaggle.com/datasets/shivamb/hulu-movies-and-tv-shows) | ||
- [Disney+ Movies and TV Shows](https://www.kaggle.com/datasets/shivamb/disney-movies-and-tv-shows) | ||
- [Amazon Prime Movies and TV Shows](https://www.kaggle.com/datasets/shivamb/amazon-prime-movies-and-tv-shows) | ||
|
||
From these 4 sets, the idea of creating a single one came up to be able to expand the data further, to be able to create more recommendations. Follow the link below. | ||
|
||
[4 Services Streaming Movies and Tv Shows](https://www.kaggle.com/datasets/sc0v1n0/4-services-streaming-movies-and-tv) | ||
|
||
If you want to understand the process more, I have a post and 4 more notebooks where I explain the notebook I created. | ||
|
||
- [Post - K-Means Recommend Movies and Tv Shows ](https://dev.to/sc0v0ne/k-means-recommend-movies-and-tv-shows-156m) | ||
- [Hulu Notebook](https://www.kaggle.com/code/sc0v1n0/k-means-recommend-movies-and-tv-shows-hulu) | ||
- [Amazon Notebook](https://www.kaggle.com/code/sc0v1n0/k-means-recommend-movies-and-tv-shows-amazon-prime) | ||
- [Disney Notebook](https://www.kaggle.com/code/sc0v1n0/k-means-recommend-movies-and-tv-shows-disney) | ||
- [Notebook Netflix](https://www.kaggle.com/code/sc0v1n0/k-means-recommend-movies-and-tv-shows-netflix) | ||
|
||
|
||
## Conclusion | ||
|
||
This personal project is a dream that I am developing, I want to evolve it further with the skills I acquire along the way. I won't always be adding updates, because I have ideas of other projects that I want to evolve, but I won't stop paying attention. I hope that other developers understand my codes and that I can transfer what I learned in this time. I hope you enjoyed it. Please, if you could leave a like on my post or on my notebooks, I would really appreciate it, so I can know if you liked it. Thank you for reading this far. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,15 +1,15 @@ | ||
from preprocess import preprocess | ||
from clusters import init_train | ||
import sys | ||
import os | ||
|
||
|
||
if __name__ == '__main__': | ||
|
||
NAME_INPUT_DIR = sys.argv[1] | ||
|
||
preprocess(NAME_INPUT_DIR) | ||
|
||
PATH_PROCESSED = os.path.join('/preprocess') | ||
|
||
DATA_TRAIN = 'train_gender.csv' | ||
DATA_MOVIES_SERIES = 'data_titles_processed.csv' | ||
|
||
init_train(PATH_PROCESSED, DATA_TRAIN, DATA_MOVIES_SERIES) | ||
init_train(DATA_TRAIN, DATA_MOVIES_SERIES) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,3 @@ | ||
#!/bin/bash | ||
docker build . -f ./containers/Dockerfile.streamlit -t streamlit_app:latest --rm | ||
docker build . -f ./containers/Dockerfile.streamlit \ | ||
-t streamlit_app:latest --rm |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters