Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ability to decompress snappy and zstd Parquet files via CPU [databricks] #11752

Merged
merged 9 commits into from
Nov 26, 2024

Conversation

jlowe
Copy link
Member

@jlowe jlowe commented Nov 22, 2024

Adds the ability to decompress snappy-compressed and ztsd-compressed Parquet files via the CPU. The Parquet data is still sent to the GPU for decoding (e.g.: run-length decode, dictionary decode, etc.).

The feature is disabled by default and has the following limitations when enabled:

  • filecache will not be leveraged for data blocks being decompressed by the CPU
  • Iceberg reads are not supported (will still decompress on the GPU as before)

@jlowe jlowe self-assigned this Nov 22, 2024
@jlowe
Copy link
Member Author

jlowe commented Nov 22, 2024

build

@jlowe
Copy link
Member Author

jlowe commented Nov 22, 2024

build

@sameerz sameerz added the feature request New feature or request label Nov 25, 2024
@jlowe jlowe changed the title Ability to decompress snappy and zstd Parquet files via CPU Ability to decompress snappy and zstd Parquet files via CPU [databricks] Nov 25, 2024
@jlowe
Copy link
Member Author

jlowe commented Nov 25, 2024

build

1 similar comment
@revans2
Copy link
Collaborator

revans2 commented Nov 25, 2024

build

@revans2
Copy link
Collaborator

revans2 commented Nov 25, 2024

Not sure what happened with the databricks build. ERROR: Couldn't find any revision to build. Verify the repository and branch configuration for this job.

@jlowe
Copy link
Member Author

jlowe commented Nov 26, 2024

build

@jlowe jlowe merged commit ff0ca0f into NVIDIA:branch-24.12 Nov 26, 2024
49 checks passed
@jlowe jlowe deleted the parquet-cpu-decompress branch November 26, 2024 21:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants