Skip to content

Latest commit

 

History

History
34 lines (23 loc) · 918 Bytes

README.md

File metadata and controls

34 lines (23 loc) · 918 Bytes

Python ETL

Minimal examples of ETL scripts/flows/components in Python. All implementations use the same input data and data transformation. Starting point and lookup source for anything date-engineer related.

Requirments

  • docker
  • make

Usage

make [implementation]

List of Implementations

Example

make spark

Will run a etl job through a containzerized spark cluster and put the result on minio. After the etl job is finished running on spark, the result can be seen in the minio console at http://127.0.0.1:8060/buckets/example/browse (user: dev & password: dev_pass)

Inspiration

Inspired by algorithms, and intended to become the etl scripts/flows/components equivalent.