The Data Engineering Zoomcamp is a free 9-week course that teaches the fundamentals of building data pipelines.
I got hands-on experience with tools like Docker, Terraform, Kestra, dbt, Spark, and Kafka, learning about everything from setting up infrastructure to working with streaming data.
The final project really helped solidify what I learned and gave me a chance to apply it all.
👩🏽💻 Link to the course
Introduction to GCP Docker and Docker Compose Running PostgreSQL with Docker Infrastructure setup with Terraform
Data Lakes and Workflow Orchestration Workflow orchestration with Kestra
API reading and pipeline scalability Data normalization and incremental loading
Introduction to BigQuery Partitioning, clustering, and best practices Machine learning in BigQuery
dbt (data build tool) with PostgreSQL & BigQuery Testing, documentation, and deployment Data visualization with Metabase
Introduction to Apache Spark DataFrames and SQL Internals of GroupBy and Joins
Introduction to Kafka Kafka Streams and KSQL Schema management with Avro