inflate: make use of `enable-dfa-jump-thread` #257

folkertdev · 2024-12-03T15:42:54Z

Refactor so that the llvm enable-dfa-jump-thread has an effect. The numbers are really good for the small chunk sizes

We're now on-par for a chunk size of 4 with zlib-ng, and doing very well overall.

It really is a massive jump for chunk sizes 4 and 5, (20% and 12% resp.) and then matters less and less for bigger chunk sizes.

NOTE: these benchmarks are run with -Cllvm-args=-enable-dfa-jump-thread; this commit does not enable that flag in any way, it (for now) has to be enabled manually via rustflags.

codecov · 2024-12-03T15:44:42Z

Codecov Report

Attention: Patch coverage is 90.03984% with 25 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
zlib-rs/src/inflate.rs	90.03%	25 Missing ⚠️

Files with missing lines	Coverage Δ
zlib-rs/src/inflate.rs	`91.23% <90.03%> (-3.92%)`	⬇️

... and 1 file with indirect coverage changes

bjorn3 · 2024-12-03T15:48:07Z

zlib-rs/src/inflate.rs

+    //      not the entirity of `dispatch`. We get a massive boost from that pass.
+    //
+    // It unfortunately does duplicate the code for some of the states; deduplicating it by having
+    // more of the states call this function is slower.


Is it possible to remove the duplication by using macros for the content of the states that are currently duplicated? It would make rust-analyzer work less well on that code though.

I don't think it is because of the labels that we break/continue too. Also it's fine because we won't touch this much ever again hopefully. But not ideal for sure.

You can pass the labels as macro arguments, right?

can you? what would be the fragment specifier of the label? is it an identifier somehow? tt might work but often requires brackets

also in our function we load the reader and writer to the stack. you could parameterize the macro on that too but, idk, is that worth it?

At the very least please add a comment to every copy of every state that is duplicated indicating that they should be kept in sync.

zlib-rs/src/inflate.rs

…own function

bjorn3 · 2024-12-04T08:03:35Z

Does this regress performance when not enabling the LLVM flag or is it perf neutral?

folkertdev · 2024-12-04T09:26:02Z

it's a win even without the flag; loading the values to the stack in this restricted case is advantageous. I get a ~10% increase at chunk size 4 (versus ~20% with the flag)

folkertdev requested a review from bjorn3 December 3, 2024 15:42

bjorn3 reviewed Dec 3, 2024

View reviewed changes

folkertdev force-pushed the llvm-dfa branch from b678c0b to 5f6179f Compare December 3, 2024 15:48

bjorn3 reviewed Dec 3, 2024

View reviewed changes

zlib-rs/src/inflate.rs Show resolved Hide resolved

inflate: move the handling of the len and subsequent states into its …

3808061

…own function

folkertdev force-pushed the llvm-dfa branch from 5f6179f to 3808061 Compare December 3, 2024 18:35

bjorn3 approved these changes Dec 4, 2024

View reviewed changes

bjorn3 merged commit 7961e8e into main Dec 4, 2024
20 checks passed

bjorn3 deleted the llvm-dfa branch December 4, 2024 09:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inflate: make use of `enable-dfa-jump-thread` #257

inflate: make use of `enable-dfa-jump-thread` #257

folkertdev commented Dec 3, 2024

codecov bot commented Dec 3, 2024 •

edited

Loading

bjorn3 Dec 3, 2024

folkertdev Dec 3, 2024

bjorn3 Dec 3, 2024

folkertdev Dec 3, 2024

folkertdev Dec 3, 2024

bjorn3 Dec 3, 2024

bjorn3 commented Dec 4, 2024

folkertdev commented Dec 4, 2024

inflate: make use of enable-dfa-jump-thread #257

inflate: make use of enable-dfa-jump-thread #257

Conversation

folkertdev commented Dec 3, 2024

codecov bot commented Dec 3, 2024 • edited Loading

Codecov Report

bjorn3 Dec 3, 2024

Choose a reason for hiding this comment

folkertdev Dec 3, 2024

Choose a reason for hiding this comment

bjorn3 Dec 3, 2024

Choose a reason for hiding this comment

folkertdev Dec 3, 2024

Choose a reason for hiding this comment

folkertdev Dec 3, 2024

Choose a reason for hiding this comment

bjorn3 Dec 3, 2024

Choose a reason for hiding this comment

bjorn3 commented Dec 4, 2024

folkertdev commented Dec 4, 2024

inflate: make use of `enable-dfa-jump-thread` #257

inflate: make use of `enable-dfa-jump-thread` #257

codecov bot commented Dec 3, 2024 •

edited

Loading