Skip to content

A big data project using pyspark and hadoop performing on a nuclear power plant database.

Notifications You must be signed in to change notification settings

TJSarno/Third-Year-Big-Data

Repository files navigation

Third-Year-Big-Data

A big data project using pyspark and hadoop performing on a nuclear power plant database to produce the following:

  • Data Validation
  • Data Cleansing
  • Box plots
  • Heat maps
  • Decision Trees
  • ANN
  • Linear Support Vector
  • Mapping/Reduce

About

A big data project using pyspark and hadoop performing on a nuclear power plant database.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages