-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Implement
CardinalityAwareRowConverter
while doing streaming merge (#…
…7401) * Use CardinalityAwareRow converter * Move CardinalityAwareRowConverter in df * Try to add cardinality aware row converter in df * Move CardinalityAwareRowConverter in df * Move CardinalityAwareRowConverter in df * Remove unnecessary clone and make wrapper mod private * Use as_any_dictionary_opt * Remove unnecessary comments * Remove done * Add test for cardinality aware row converter on high card dict * Add test for cardinality aware row converter on low card dict * Ignore the test_dict_merge_infinite test * Remove phantom Arc import * Remove the infinite stream test * Update datafusion/core/src/physical_plan/wrapper.rs Co-authored-by: Andrew Lamb <[email protected]> * Update convert_rows signature and add empty_rows * Add comments to the test * Use Some and take() semantics * Init with a row converter instance instead of none * Remove unused variable * Remove unused imports * Remove unused imports * Change GroupValues * Add comments, run fmt * Init with a empty row converter * Use the cardinality aware row converter * Reconvert the group values * Rename wrapper to row_converter * Recovert the group values * Convert back to dictionary * fmt * A fmt pass * fix: fmt * Move the reconversion to dict to just consider group by columns * Reconvert only the correct cols * Use assert eq * clippy * clippy * Add comment about the reconversion to dict * Fix the merge issues * move data type conversion * fix * fix docs * fix bug * Improve tests * simplify * Use cardinality aware row converter in gby order * clippy * Adjust memory test * Add doc comments about row converter * remove outdated comment * Rework partition size calculation to make test clearer * Increase threshold to 512 * Update row converter tests according to new threshold * fix clippy * fix panic * Adjust constant for test --------- Co-authored-by: Andrew Lamb <[email protected]>
- Loading branch information
1 parent
44b3318
commit f4c4ee1
Showing
9 changed files
with
461 additions
and
69 deletions.
There are no files selected for viewing
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.