Minor optimizations in Python code #13478

cclauss · 2024-07-27T16:48:00Z

% ruff check --select=C4,PERF,PIE810 --statistics | grep "\[\*\]"

Edit as requested below:

Ruff has two main command verbs: check to lint Python code and format to format code in a way that is highly compatible with psf/black. The --select option allows us to choose which of the 800+ linting rules we wish to apply to the code. Here we focus on minor optimizations. The --statistics option provides only summary information.

The grep "\[\*\]" command says that we are only interested in rules that have automated fixers.

The detailed description of each rule including Why is this bad? can be obtained with a command of the form ruff rule RUF017 as demonstrated below.

24	PIE810 	[*] multiple-starts-ends-with
13	C408   	[*] unnecessary-collection-call
 8	C419   	[*] unnecessary-comprehension-in-call
 5	C416   	[*] unnecessary-comprehension
 2	C400   	[*] unnecessary-generator-list
 2	C413   	[*] unnecessary-call-around-sorted
 2	PERF102	[*] incorrect-dict-iterator
 1	C401   	[*] unnecessary-generator-set
 1	C405   	[*] unnecessary-literal-set
 1	C417   	[*] unnecessary-map

% ruff check --select=C4,PERF,PIE810 --fix --unsafe-fixes

Found 140 errors (58 fixed, 82 remaining).

% ruff rule RUF017

quadratic-list-summation (RUF017)

Derived from the Ruff-specific rules linter.

Fix is always available.

What it does

Checks for the use of sum() to flatten lists of lists, which has
quadratic complexity.

Why is this bad?

The use of sum() to flatten lists of lists is quadratic in the number of
lists, as sum() creates a new list for each element in the summation.

Instead, consider using another method of flattening lists to avoid
quadratic complexity. The following methods are all linear in the number of
lists:

functools.reduce(operator.iadd, lists, [])
list(itertools.chain.from_iterable(lists))
[item for sublist in lists for item in sublist]

When fixing relevant violations, Ruff defaults to the functools.reduce
form, which outperforms the other methods in microbenchmarks.

Example

lists = [[1, 2, 3], [4, 5, 6], [7, 8, 9]]
joined = sum(lists, [])

Use instead:

import functools
import operator


lists = [[1, 2, 3], [4, 5, 6], [7, 8, 9]]
functools.reduce(operator.iadd, lists, [])

References

eli-schwartz

I think the commit message could probably be a bit better (nothing huge, but I'd at least include in the extended description the command you used to reproduce the changes).

The other big thing to me is my comment below about the second commit.

mesonbuild/ast/interpreter.py

mesonbuild/compilers/cuda.py

eli-schwartz · 2024-07-28T02:22:55Z

mesonbuild/compilers/cuda.py

@@ -233,7 +235,7 @@ def _shield_nvcc_list_arg(cls, arg: str, listmode: bool = True) -> str:
                # There are single quotes. Double-quote them, and single-quote the
                # strings between them.
                l = [cls._shield_nvcc_list_arg(s) for s in arg.split(SQ)]
-                l = sum([[s, DQSQ] for s in l][:-1], [])  # Interleave l with DQSQs
+                l = functools.reduce(operator.iadd, [[s, DQSQ] for s in l][:-1], [])  # Interleave l with DQSQs
                return ''.join(l)


This whole logic feels kinda hairy honestly. I wonder if there's a better way to do it that doesn't need additional imports...

Hairy indeed. The commit message lists the three linear solutions and why ruff defaults to this one. Given its hairiness, let's revert this change and deal with it in a separate PR.

run_meson_command_tests.py

test cases/common/271 env in generator.process/generate_main.py

test cases/unit/101 relative find program/foo.py

mesonbuild/arglist.py

mesonbuild/compilers/mixins/visualstudio.py

run_project_tests.py

unittests/allplatformstests.py

eli-schwartz

Again, please no "Apply suggestions from code review" commits or other forms of fixup for a previous commit in the same PR.

cclauss · 2024-07-30T16:38:21Z

Can someone please stop and rerun the runaway GitHub Action? Five hours is a bit too long!

eli-schwartz · 2024-07-30T16:45:12Z

$ git --no-pager range-diff origin/master 3f998e47b 6709c1800
1:  3f998e47b ! 1:  6709c1800 minor optimizations
    @@ Metadata
      ## Commit message ##
         minor optimizations
     
    - ## ci/ciimage/build.py ##
    -@@ ci/ciimage/build.py: class ImageDef:
    -         data = json.loads(path.read_text(encoding='utf-8'))
    - 
    -         assert isinstance(data, dict)
    --        assert all([x in data for x in ['base_image', 'env']])
    -+        assert all(x in data for x in ['base_image', 'env'])
    -         assert isinstance(data['base_image'], str)
    -         assert isinstance(data['env'],  dict)
    - 
    -
      ## docs/jsonvalidator.py ##
     @@ docs/jsonvalidator.py: root: dict

Any particular reason for this? Just curious...

dcbaker

The changes here look good to me. Once Eli is happy with everything I'm fine to merge this.

cclauss · 2024-07-30T18:23:48Z

I reverted ci/ciimage/build.py because the three build_images jobs were failing…. I could not see how this modification would cause tests to fail but out of an abundance of caution…

eli-schwartz · 2024-07-30T18:41:40Z

They fail in git master. Those jobs are only run on a weekly schedule or whenever the job definition itself changes -- so avoiding to change the job definition, doesn't make any difference to the test other than preventing its status from showing up.

It is perfectly fine to include that change...

cclauss requested review from mensinda, dcbaker and jpakkane as code owners July 27, 2024 16:48

eli-schwartz requested changes Jul 28, 2024

View reviewed changes

cclauss force-pushed the minor-optimizations branch from 5d6ddb6 to aa6e68c Compare July 28, 2024 08:41

cclauss requested a review from eli-schwartz July 28, 2024 09:10

cclauss force-pushed the minor-optimizations branch from aa6e68c to 6a2efc6 Compare July 29, 2024 07:58

eli-schwartz reviewed Jul 29, 2024

View reviewed changes

mesonbuild/arglist.py Outdated Show resolved Hide resolved

mesonbuild/compilers/mixins/visualstudio.py Outdated Show resolved Hide resolved

cclauss force-pushed the minor-optimizations branch 2 times, most recently from 039d1ec to d754adf Compare July 29, 2024 17:19

dcbaker requested changes Jul 29, 2024

View reviewed changes

run_project_tests.py Outdated Show resolved Hide resolved

unittests/allplatformstests.py Outdated Show resolved Hide resolved

eli-schwartz requested changes Jul 29, 2024

View reviewed changes

cclauss force-pushed the minor-optimizations branch from 6a600f1 to 3f998e4 Compare July 29, 2024 23:49

cclauss mentioned this pull request Jul 30, 2024

Simplify Python code with some ruff rules SIM #13485

Closed

cclauss requested review from dcbaker and eli-schwartz July 30, 2024 09:55

cclauss force-pushed the minor-optimizations branch from 3f998e4 to 6709c18 Compare July 30, 2024 11:36

dcbaker approved these changes Jul 30, 2024

View reviewed changes

minor optimizations

26eab9a

cclauss force-pushed the minor-optimizations branch from 6709c18 to 26eab9a Compare July 30, 2024 18:53

cclauss closed this Aug 16, 2024

cclauss deleted the minor-optimizations branch August 16, 2024 09:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor optimizations in Python code #13478

Minor optimizations in Python code #13478

cclauss commented Jul 27, 2024 •

edited

Loading

eli-schwartz left a comment

eli-schwartz Jul 28, 2024

cclauss Jul 28, 2024 •

edited

Loading

eli-schwartz left a comment

cclauss commented Jul 30, 2024

eli-schwartz commented Jul 30, 2024

dcbaker left a comment

cclauss commented Jul 30, 2024 •

edited

Loading

eli-schwartz commented Jul 30, 2024

Minor optimizations in Python code #13478

Minor optimizations in Python code #13478

Conversation

cclauss commented Jul 27, 2024 • edited Loading

quadratic-list-summation (RUF017)

What it does

Why is this bad?

Example

References

eli-schwartz left a comment

Choose a reason for hiding this comment

eli-schwartz Jul 28, 2024

Choose a reason for hiding this comment

cclauss Jul 28, 2024 • edited Loading

Choose a reason for hiding this comment

eli-schwartz left a comment

Choose a reason for hiding this comment

cclauss commented Jul 30, 2024

eli-schwartz commented Jul 30, 2024

dcbaker left a comment

Choose a reason for hiding this comment

cclauss commented Jul 30, 2024 • edited Loading

eli-schwartz commented Jul 30, 2024

cclauss commented Jul 27, 2024 •

edited

Loading

cclauss Jul 28, 2024 •

edited

Loading

cclauss commented Jul 30, 2024 •

edited

Loading