Skip to content

Nandan9911/Hadoop-Projects

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Hadoop project - 1 (Q1,Q2,Q3)

MapReduce project on Shakespeare data

DATA SET: https://de-mapreduce-gutenberg.s3.amazonaws.com/100-0.txt

  1. Which word has the highest frequency of occurrence in the document?
  2. What is the frequency of occurrence of the word ‘Romeo’? (Ignore cases and don't remove punctuation marks from any words.)
  3. What is the frequency of the phrase "circumference." in the data set? (You do not need to remove the punctuation marks from the words.)

MapReduce on Airports data

Data Set: https://drive.google.com/file/d/1DpfofGJbMeB4ZIh6wM54nYoeHQUjH3uL/view?usp=share_link

  1. Count the number of unique ordered pairs of origin and destination (Origin, Destination) present in the dataset, i.e., for two flights, either the origin or the destination differs.
  2. What is the airport code and the number of flights corresponding to that airport, with the maximum number of outgoing flights in the year 2004?

MapReduce on Cricket data

DATA SET: https://drive.google.com/file/d/10TLQxUn1ndkUcfeHRNVcB_JZtd_c-mxw/view?usp=share_link

  1. Which player scored the highest number of centuries?
  2. In which year did Indian players score the maximum number of centuries?

Hadoop Project - 2

Here, we have chosen the stock market dataset (NYSE.csv) on which we have performed map-reduce operations. Following is the structure of the data. Kindly find the solutions to the questions below.

Data Structure

  1. Exchange Name 2 Stock symbol
  2. Transaction date
  3. Opening price of the stock
  4. Intra day high price of the stock
  5. Intra day low price of the stock
  6. Closing price of the stock
  7. Total Volume of the stock on the particular day
  8. Adjustment Closing price of the stock Field Separator – comma

Find all time High price for each stock

About

Minor projects done on Hadoop

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages