Skip to content

Latest commit

 

History

History
11 lines (10 loc) · 261 Bytes

README.md

File metadata and controls

11 lines (10 loc) · 261 Bytes

Third-Year-Big-Data

A big data project using pyspark and hadoop performing on a nuclear power plant database to produce the following:

  • Data Validation
  • Data Cleansing
  • Box plots
  • Heat maps
  • Decision Trees
  • ANN
  • Linear Support Vector
  • Mapping/Reduce