Developed a recommendation system that implemented collaborative filtering on an Amazon dataset of 500,000+ rows.
Implemented Spark and Spark ML to extract, clean and pre-process dataset in order to fit it on Alternating Least Squares machine learning model, to provide product recommendations based on product and review data.
Achieved a Root Mean Square Error value of 0.5763, implying that model is accurate