Remove PartiallyOrdered handling from BoundedWindowAggExec #11

mustafasrepo · 2024-03-14T12:13:05Z

Which issue does this PR close?

Closes #.

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

# Conflicts: # datafusion/sqllogictest/test_files/window.slt

mustafasrepo · 2024-03-15T10:01:35Z

According to comparison this behaviour harms performance. I did compare following plans

"AggregateExec: mode=Partial, gby=[a@0 as a, b@1 as b, d@3 as d], aggr=[sum1], ordering_mode=Sorted",
"  PartialSortExec: expr=[a@0 ASC,b@1 ASC,d@3 ASC], common_prefix_length=[2]",
"    MemoryExec: partitions=1, partition_sizes=[61], output_ordering=a@0 ASC,b@1 ASC,c@2 ASC",

and

"AggregateExec: mode=Partial, gby=[a@0 as a, b@1 as b, d@3 as d], aggr=[sum1], ordering_mode=PartiallySorted([0, 1])",
"  MemoryExec: partitions=1, partition_sizes=[61], output_ordering=a@0 ASC,b@1 ASC,c@2 ASC",

according to tests performance of the second version is better. Hence, this change is not beneficial in terms of performance.

I did another set of tests with window. It seems that, this change is beneficial for window functions
following plan:

"BoundedWindowAggExec: wdw=[count(x) PB([\"a\", \"c\"]), OB:[\"b\"]: Ok(Field { name: \"count(x) PB([\\\"a\\\", \\\"c\\\"]), OB:[\\\"b\\\"]\", data_type: Int64, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }), frame: WindowFrame { units: Rows, start_bound: Preceding(UInt64(NULL)), end_bound: CurrentRow, is_causal: true }], mode=[Sorted]",
"  PartialSortExec: expr=[a@0 ASC,c@2 ASC,b@1 ASC], common_prefix_length=[1]",
"    MemoryExec: partitions=1, partition_sizes=[80], output_ordering=a@0 ASC,b@1 ASC,c@2 ASC",

executes faster than

"BoundedWindowAggExec: wdw=[count(x) PB([\"a\", \"c\"]), OB:[\"b\"]: Ok(Field { name: \"count(x) PB([\\\"a\\\", \\\"c\\\"]), OB:[\\\"b\\\"]\", data_type: Int64, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }), frame: WindowFrame { units: Rows, start_bound: Preceding(UInt64(NULL)), end_bound: CurrentRow, is_causal: true }], mode=[PartiallySorted([0])]",
"  MemoryExec: partitions=1, partition_sizes=[80], output_ordering=a@0 ASC,b@1 ASC,c@2 ASC",

# Conflicts: # datafusion/core/tests/fuzz_cases/window_fuzz.rs

datafusion/physical-plan/src/windows/bounded_window_agg_exec.rs

metesynnada · 2024-03-20T07:19:04Z

datafusion/physical-plan/src/windows/bounded_window_agg_exec.rs

    }

    fn is_mode_linear(&self) -> bool {
-        self.ordered_partition_by_indices.is_empty()
+        true


Is this necessary?

metesynnada · 2024-03-20T07:24:10Z

datafusion/sqllogictest/test_files/window.slt

@@ -2933,6 +2933,27 @@ LOCATION '../core/tests/data/window_2.csv';

 # test_infinite_source_partition_by

+query TT
+EXPLAIN SELECT a, b, c,


The performance of this test may be checked.

I compared the performance of this test with following plan1:

"ProjectionExec: expr=[a@0 as a, b@1 as b, x@3 as x, SUM(source.c) PARTITION BY [source.a, source.x] ORDER BY [source.b ASC NULLS LAST, source.c ASC NULLS LAST] ROWS BETWEEN 2 PRECEDING AND 1 FOLLOWING@4 as sum1]", " BoundedWindowAggExec: wdw=[SUM(source.c) PARTITION BY [source.a, source.x] ORDER BY [source.b ASC NULLS LAST, source.c ASC NULLS LAST] ROWS BETWEEN 2 PRECEDING AND 1 FOLLOWING: Ok(Field { name: \"SUM(source.c) PARTITION BY [source.a, source.x] ORDER BY [source.b ASC NULLS LAST, source.c ASC NULLS LAST] ROWS BETWEEN 2 PRECEDING AND 1 FOLLOWING\", data_type: Int64, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }), frame: WindowFrame { units: Rows, start_bound: Preceding(UInt64(2)), end_bound: Following(UInt64(1)), is_causal: false }], mode=[PartiallySorted([0])]", " StreamingTableExec: partition_sizes=1, projection=[a, b, c, x], infinite_source=true, output_ordering=[a@0 ASC, b@1 ASC, c@2 ASC]",

and plan2:

"PartialSortExec: expr=[a@0 ASC NULLS LAST,b@1 ASC NULLS LAST], common_prefix_length=[1]", " ProjectionExec: expr=[a@0 as a, b@1 as b, x@3 as x, SUM(source.c) PARTITION BY [source.a, source.x] ORDER BY [source.b ASC NULLS LAST, source.c ASC NULLS LAST] ROWS BETWEEN 2 PRECEDING AND 1 FOLLOWING@4 as sum1]", " BoundedWindowAggExec: wdw=[SUM(source.c) PARTITION BY [source.a, source.x] ORDER BY [source.b ASC NULLS LAST, source.c ASC NULLS LAST] ROWS BETWEEN 2 PRECEDING AND 1 FOLLOWING: Ok(Field { name: \"SUM(source.c) PARTITION BY [source.a, source.x] ORDER BY [source.b ASC NULLS LAST, source.c ASC NULLS LAST] ROWS BETWEEN 2 PRECEDING AND 1 FOLLOWING\", data_type: Int64, nullable: true, dict_id: 0, dict_is_ordered: false, metadata: {} }), frame: WindowFrame { units: Rows, start_bound: Preceding(UInt64(2)), end_bound: Following(UInt64(1)), is_causal: false }], mode=[Sorted]", " PartialSortExec: expr=[a@0 ASC,x@3 ASC,b@1 ASC NULLS LAST,c@2 ASC NULLS LAST], common_prefix_length=[1]", " StreamingTableExec: partition_sizes=1, projection=[a, b, c, x], infinite_source=true, output_ordering=[a@0 ASC, b@1 ASC, c@2 ASC]",

results are:

single chunk multi chunk

plan1 41.968927ms 79.660012ms

plan2 39.103565ms 42.766507ms

It seems that we benefit from the coalesce effect of the PartialSortExec in the plans. However, when batch size is large benefit is marginal.

ozankabak · 2024-04-24T10:48:29Z

Since the benefit is marginal, let's keep this one the backburner for a while and revisit later.

… `interval` (apache#11466) * Unparser rule for datatime cast (#10) * use timestamp as the identifier for date64 * rename * implement CustomDialectBuilder * fix * dialect with interval style (#11) --------- Co-authored-by: Phillip LeBlanc <[email protected]> * fmt * clippy * doc * Update datafusion/sql/src/unparser/expr.rs Co-authored-by: Andrew Lamb <[email protected]> * update the doc for CustomDialectBuilder * fix doc test --------- Co-authored-by: Phillip LeBlanc <[email protected]> Co-authored-by: Andrew Lamb <[email protected]>

mustafasrepo added 10 commits March 13, 2024 16:24

Remove partially ordered execution from bounded windowExec

d022cf3

Resolve linter error

fe06250

Remove unnecessary code

9d103bc

Aggregate starting code.

f0bed47

Add aggregate handling

f4583f2

Add aggregate tests

7bfe7ea

Add aggregate literal ignore

d9df3a0

Add partiallysorted check to the group ordering.

4d798f3

Update tests to reflect new behaviour

e5880dc

Resolve linter errors, Remove unnecessary code.

ece6418

github-actions bot added core sqllogictest labels Mar 14, 2024

mustafasrepo added 7 commits March 14, 2024 15:17

Merge branch 'apache_main' into feature/window_partiall_ordered_remove

5ec1067

# Conflicts: # datafusion/sqllogictest/test_files/window.slt

tmp

3523419

Move impls to single rule

f356f1a

Simplifications

d1a5d6d

Add invalidated ordering handling

b494cab

Minor changes

4308adf

Resolve linter errors. Remove constants during input order mode analysis

1e891fb

github-actions bot added the physical-expr label Mar 14, 2024

Simplifications

8f5ef80

mustafasrepo added 4 commits March 18, 2024 11:42

Bring back aggregate partial sort support

0981c2f

Fix tests

e9e56de

Merge branch 'apache_main' into feature/window_partiall_ordered_remove

6492611

# Conflicts: # datafusion/core/tests/fuzz_cases/window_fuzz.rs

Resolve linter errors

d780937

github-actions bot removed the physical-expr label Mar 18, 2024

Minor changes

859da28

mustafasrepo changed the title ~~Remove PartiallyOrdered handling from BoundedWindowAggExec and AggregateExec~~ Remove PartiallyOrdered handling from BoundedWindowAggExec Mar 18, 2024

mustafasrepo requested a review from metesynnada March 18, 2024 13:03

metesynnada reviewed Mar 20, 2024

View reviewed changes

Minor changes

e8a4aaa

berkaysynnada force-pushed the apache_main branch from dd6a3e8 to 83d3c5a Compare August 26, 2024 12:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove PartiallyOrdered handling from BoundedWindowAggExec #11

Remove PartiallyOrdered handling from BoundedWindowAggExec #11

mustafasrepo commented Mar 14, 2024

mustafasrepo commented Mar 15, 2024 •

edited

Loading

metesynnada Mar 20, 2024

metesynnada Mar 20, 2024

mustafasrepo Mar 20, 2024

ozankabak commented Apr 24, 2024

	single chunk	multi chunk
plan1	41.968927ms	79.660012ms
plan2	39.103565ms	42.766507ms
It seems that we benefit from the coalesce effect of the `PartialSortExec` in the plans. However, when batch size is large benefit is marginal.

Remove PartiallyOrdered handling from BoundedWindowAggExec #11

Are you sure you want to change the base?

Remove PartiallyOrdered handling from BoundedWindowAggExec #11

Conversation

mustafasrepo commented Mar 14, 2024

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

mustafasrepo commented Mar 15, 2024 • edited Loading

metesynnada Mar 20, 2024

Choose a reason for hiding this comment

metesynnada Mar 20, 2024

Choose a reason for hiding this comment

mustafasrepo Mar 20, 2024

Choose a reason for hiding this comment

ozankabak commented Apr 24, 2024

mustafasrepo commented Mar 15, 2024 •

edited

Loading