Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement percentile computation from histogram data #1416

Merged
merged 36 commits into from
Oct 16, 2023

Conversation

ttnghia
Copy link
Collaborator

@ttnghia ttnghia commented Sep 16, 2023

This implements the last evaluation step to support distributed percentile aggregation. The first (update) and second (merge) steps are implemented through HISTOGRAM and MERGE_HISTOGRAM aggregations in cudf (rapidsai/cudf#14045).

Depends on:

Status:

  • Working prototypes.
  • Integration tests in spark-rapids integration tests (reduction + groupby) passed.
  • Clean up and docs.

@ttnghia ttnghia self-assigned this Sep 16, 2023
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
@ttnghia ttnghia marked this pull request as ready for review September 30, 2023 04:04
@ttnghia
Copy link
Collaborator Author

ttnghia commented Sep 30, 2023

build

Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
Signed-off-by: Nghia Truong <[email protected]>
@ttnghia
Copy link
Collaborator Author

ttnghia commented Oct 1, 2023

build

Signed-off-by: Nghia Truong <[email protected]>
@ttnghia
Copy link
Collaborator Author

ttnghia commented Oct 2, 2023

build

Signed-off-by: Nghia Truong <[email protected]>
@ttnghia ttnghia marked this pull request as ready for review October 6, 2023 20:44
@ttnghia ttnghia changed the base branch from branch-23.10 to branch-23.12 October 11, 2023 15:59
Copy link
Collaborator

@revans2 revans2 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The java Side of things looks good, but I don't feel confident really reviewing the C++ code. I could follow what was being done and it looks good, but I am no expert here.

Copy link
Collaborator

@hyperbolic2346 hyperbolic2346 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

First pass looks good. Some questions.

src/main/cpp/src/histogram.cu Show resolved Hide resolved
src/main/cpp/src/histogram.cu Outdated Show resolved Hide resolved
src/main/cpp/src/histogram.cu Show resolved Hide resolved
src/main/cpp/src/histogram.cu Show resolved Hide resolved
@ttnghia
Copy link
Collaborator Author

ttnghia commented Oct 14, 2023

build

@ttnghia ttnghia merged commit 150bafc into NVIDIA:branch-23.12 Oct 16, 2023
1 check passed
@ttnghia ttnghia deleted the percentile branch October 16, 2023 20:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants