MapReduce, Pig Latin, Hive, Spark Scripts. Worked on the IMDB dataset of 100k records.
The Assignments consists of Scripts which does distributed data storage and Distributed data processing on the IMDB datasets to get the various ratings Insights like the highest and Lowest Rated Movies. With Spark have used a built in recommendation library to make Movie Prediction.