Skip to content
#

aws-glue-crawler

Here are 35 public repositories matching this topic...

The Project aims to establish a robust data pipeline for tracking and analyzing sales performance using various AWS services. The process involves creating a DynamoDB database, implementing Change Data Capture (CDC), utilizing Kinesis streams, and finally, storing and querying the data in Amazon Athena.

  • Updated Feb 11, 2024
  • Python

In this project, you will execute an End-To-End Data Engineering Project on Real-Time Stock Market Data using Kafka. We are going to use different technologies such as Python, Amazon Web Services (AWS), Apache Kafka, Glue, Athena, and SQL.

  • Updated May 23, 2024
  • Jupyter Notebook

Developed an ETL pipeline for real-time ingestion of stock market data from the stock-market-data-manage.onrender.com API. Engineered the system to store data in Parquet format for optimized query processing and incorporated data quality checks to ensure accuracy prior to visualization.

  • Updated Jun 25, 2024
  • Python

Improve this page

Add a description, image, and links to the aws-glue-crawler topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the aws-glue-crawler topic, visit your repo's landing page and select "manage topics."

Learn more