Move sum test to slt or optimizer_integration #10807

jayzhan211 · 2024-06-05T14:05:58Z

Which issue does this PR close?

Part of #10731

Rationale for this change

What changes are included in this PR?

I move the test to slt if possible and trivial. Others are moved to core/tests

Are these changes tested?

Are there any user-facing changes?

Signed-off-by: jayzhan211 <[email protected]>

alamb · 2024-06-06T20:52:16Z

datafusion/optimizer/src/common_subexpr_eliminate.rs

@@ -861,105 +861,6 @@ mod test {
        assert_eq!(expected, formatted_plan);
    }

-    #[test]
-    fn id_array_visitor() -> Result<()> {


I think I understand why you moved these tests to the core crate (as they rely on aggregate functions that will soon not be direct dependencies on datafusion-optimizer

However, with this change now the tests for individual passes may be spread in two places (core and optimizer) so evaluating test coverage will be harder. I think we should try to keep all the tests together

Also, I think it is quite nice that the tests are with each optimizer pass (I found it quite helpful when I was working on avoiding as many copies in the optimizer).

Options I can see:

Move all the other optimizer tests over to datafusion/core

find a way to leave the tests in datafusion-optimizer

One idea I had about keeping the tests in datafusion-optimizer would be to add stu aggregate functions to the optimizer crate (for example we could add a sum() AggregateUDF that had the same signature as sum but didn't actually run (would panic if we tried to create an accumulator, etc)?

The downside is that then there is a danger that the stubs don't behave the same way as the actual functions and there are bugs / limitations when they are used together...

for example we could add a sum() AggregateUDF that had the same signature as sum

Introduce another similar function for test only increase the maintain cost unless they are not changed frequently 🤔 I also prefer that the tests stay close to the code, maybe it is a acceptable tradeoff

yeah, there are tradeoffs both way

Update is that @jayzhan211 prototyped the stub approach here #10816

jayzhan211 added 6 commits June 5, 2024 19:29

rm sum in logical plan builder

9457252

Signed-off-by: jayzhan211 <[email protected]>

rm count wildcard

5a684cf

Signed-off-by: jayzhan211 <[email protected]>

rm cse

6e58e21

Signed-off-by: jayzhan211 <[email protected]>

rm eliminate filter

edde325

Signed-off-by: jayzhan211 <[email protected]>

rm eliminate limit

7b65f29

Signed-off-by: jayzhan211 <[email protected]>

rm single distinct to groupby

8023743

Signed-off-by: jayzhan211 <[email protected]>

github-actions bot added logical-expr Logical plan and expressions optimizer Optimizer rules core Core DataFusion crate sqllogictest SQL Logic Tests (.slt) labels Jun 5, 2024

jayzhan211 added 4 commits June 6, 2024 08:55

mv push down filter entirely to core tests

32e9343

Signed-off-by: jayzhan211 <[email protected]>

mv id_array_visitor to own file

937f675

Signed-off-by: jayzhan211 <[email protected]>

mv sclar subquery to joins

858a38f

Signed-off-by: jayzhan211 <[email protected]>

rm sum

dbac300

Signed-off-by: jayzhan211 <[email protected]>

jayzhan211 marked this pull request as ready for review June 6, 2024 11:37

alamb reviewed Jun 6, 2024

View reviewed changes

jayzhan211 mentioned this pull request Jun 7, 2024

Remove expr_fn::sum and replace them with function stub #10816

Merged

jayzhan211 marked this pull request as draft June 7, 2024 01:04

jayzhan211 closed this in #10816 Jun 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move sum test to slt or optimizer_integration #10807

Move sum test to slt or optimizer_integration #10807

jayzhan211 commented Jun 5, 2024 •

edited

Loading

alamb Jun 6, 2024

jayzhan211 Jun 6, 2024 •

edited

Loading

alamb Jun 7, 2024

alamb Jun 7, 2024

Move sum test to slt or optimizer_integration #10807

Move sum test to slt or optimizer_integration #10807

Conversation

jayzhan211 commented Jun 5, 2024 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

alamb Jun 6, 2024

Choose a reason for hiding this comment

jayzhan211 Jun 6, 2024 • edited Loading

Choose a reason for hiding this comment

alamb Jun 7, 2024

Choose a reason for hiding this comment

alamb Jun 7, 2024

Choose a reason for hiding this comment

jayzhan211 commented Jun 5, 2024 •

edited

Loading

jayzhan211 Jun 6, 2024 •

edited

Loading