Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compressed jsons uploads and downloads #209

Merged
merged 3 commits into from
May 3, 2021
Merged

Compressed jsons uploads and downloads #209

merged 3 commits into from
May 3, 2021

Conversation

alok87
Copy link
Contributor

@alok87 alok87 commented Apr 30, 2021

Why?

Improve loader performance #195 (comment)

Experimenting with compressions to see the improvement. 10x diff

  • Batcher uploads the data to s3 as gz compressed jsons.
10k lines in file, uploads in: 300ms(compressed), 3s(uncompressed)
10k lines in file, uploads in: 3s(compressed), 30s(uncompressed)
  • Loader should make the COPY happen using gzip uncompression.

@alok87 alok87 requested a review from justjkk April 30, 2021 10:21
@alok87 alok87 merged commit 6a93c2b into master May 3, 2021
@alok87 alok87 deleted the compression branch May 31, 2021 07:20
alok87 added a commit that referenced this pull request Jun 5, 2021
Compressed jsons uploads and downloads
alok87 added a commit that referenced this pull request Jun 7, 2021
Compressed jsons uploads and downloads
alok87 added a commit that referenced this pull request Jun 17, 2021
Compressed jsons uploads and downloads
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant