Skip to content

Latest commit

 

History

History
8 lines (6 loc) · 399 Bytes

README.md

File metadata and controls

8 lines (6 loc) · 399 Bytes

Final Project PySpark (HPPL-21)

The repository contains three notebooks of practice:

  1. General introduction to PySpark syntaxes for ETL data processing
  2. Example of accomplishing ML related task (regression problem) using intrinsic PySpark modules.
  3. Example of parallelization of Pi number calculation in PySpark

HWs and Projects on High Performance Python Lab (HPPL `21) course at Skoltech