Add `#[loop_match]` for improved DFA codegen #138780

folkertdev · 2025-03-21T11:40:03Z

tracking issue: #132306
project goal: rust-lang/rust-project-goals#258

This PR adds the #[loop_match] attribute, which aims to improve code generation for state machines. For some (very exciting) benchmarks, see rust-lang/rust-project-goals#258 (comment)

Currently, a very restricted syntax pattern is accepted. We'd like to get feedback and merge this now before we go too far in a direction that others have concerns with.

current state

We accept code that looks like this

#[loop_match]
loop {
    state = 'blk: {
        match state {
            State::A => {
                #[const_continue]
                break 'blk State::B
            }
            State::B => { /* ... */ }
            /* ... */
        }
    }
}

a loop should have the same semantics with and without #[loop_match]: normal continue and break continue to work
#[const_continue] is only allowed in loops annotated with #[loop_match]`
the loop body needs to have this particular shape (a single assignment to the match scrutinee, with the body a labelled block containing just a match)

future work

perform const evaluation on the break value
support more state/scrutinee types
report proper errors when the jump target could not be determined statically
rework the pattern matching logic so hopefully it can use more existing code

maybe future work

allow continue 'label value syntax, which #[const_continue] could then use.
allow the match to be on an arbitrary expression (e.g. State::Initial)
attempt to also optimize break/continue expressions that are not marked with #[const_continue]

r? @traviscross

rustbot · 2025-03-21T11:40:10Z

Some changes occurred in match checking

cc @Nadrieril

Some changes occurred in compiler/rustc_passes/src/check_attr.rs

cc @jdonszelmann

Some changes occurred in rustc_ty_utils::consts.rs

cc @BoxyUwU

traviscross

Thanks @folkertdev for putting up this PR. The big picture looks right, in terms of the behavior of the tests and how to approach the experiment in terms of starting with the attributes for thiis.

This is a first partial pass on the details.

@rustbot author

compiler/rustc_feature/src/unstable.rs

compiler/rustc_mir_build/messages.ftl

tests/ui/loop-match/break-to-block.rs

compiler/rustc_mir_build/src/builder/expr/into.rs

compiler/rustc_mir_build/src/builder/scope.rs

compiler/rustc_passes/messages.ftl

compiler/rustc_middle/src/thir.rs

folkertdev

Thanks for the detailed review!

I've fixed a bunch of the low-hanging fruit (e.g. in the tests). For the actual pattern matching logic, I have a branch with what I believe is a better solution that re-uses more existing pattern matching infra. We'll come back to that here once björn has had a chance to look at it.

compiler/rustc_mir_build/src/builder/expr/into.rs

compiler/rustc_middle/src/thir.rs

rustbot · 2025-03-24T10:46:56Z

Some changes occurred in exhaustiveness checking

cc @Nadrieril

Some changes occurred in match lowering

cc @Nadrieril

compiler/rustc_mir_build/src/builder/expr/stmt.rs

bors · 2025-03-26T15:19:18Z

☔ The latest upstream changes (presumably #138974) made this pull request unmergeable. Please resolve the merge conflicts.

compiler/rustc_mir_build/src/builder/scope.rs

Co-authored-by: Folkert de Vries <[email protected]>

Co-authored-by: Travis Cross <[email protected]>

folkertdev · 2025-04-04T13:12:50Z

We've done a bunch of work here, and I believe all of the earlier review comments have now been dealt with.

@rustbot ready

compiler/rustc_feature/src/builtin_attrs.rs

compiler/rustc_feature/src/unstable.rs

compiler/rustc_middle/src/thir.rs

traviscross · 2025-04-06T00:27:15Z

compiler/rustc_middle/src/thir/visit.rs

+        LoopMatch { state, ref arms, .. } => {
+            visitor.visit_expr(&visitor.thir()[state]);
+            for &arm in &**arms {
+                visitor.visit_arm(&visitor.thir()[arm]);
+            }
+        }


Let's combine this arm with the one for Match below.

compiler/rustc_mir_build/src/builder/matches/mod.rs

compiler/rustc_mir_build/src/builder/expr/into.rs

compiler/rustc_mir_build/src/builder/matches/mod.rs

traviscross · 2025-04-06T02:47:24Z

compiler/rustc_mir_build/src/builder/matches/mod.rs

+        }
+    }
+
+    fn static_pattern_match_help(


Maybe there's some better name for this?

Yes please. And documentation. I understand this checks whether constant matches pat?

compiler/rustc_mir_build/src/builder/scope.rs

traviscross · 2025-04-06T03:01:58Z

@rustbot author

As a lang matter, this is looking reasonable to me in terms of a lang experiment.

As an impl matter, this is starting to look not unreasonable to me, but I'd like for @Nadrieril to also have a look if he's able.

r? @Nadrieril

@Nadrieril: I still need to raise this in a lang meeting to confirm that everyone is happy to see the experiment here in light of earlier objections, so please don't merge this just yet. You can leave it back in my hands after you're happy with the impl.

Also CC @oli-obk as this work is carrying over some FIXME items you have marked.

rustbot · 2025-04-06T03:02:02Z

Reminder, once the PR becomes ready for a review, use @rustbot ready.

Nadrieril · 2025-04-06T10:26:42Z

Only speaking of the MIR lowering part: my opinion on the current implementation is that this is a fine approach for an experiment, but this will need to change in depth before it can be relied on.

For one, I believe the static pattern-matching should use the const-eval interpreter instead of manually operating on valtrees. For two, it should not duplicate the work of match lowering; instead, BuiltMatchTree should track the tests cases that lead to each branch so that we can reuse them. This is the only way or-patterns can be supported properly.

Haven't reviewed the rest, would appreciate any help there, otherwise I'll get to it once approved.

Nadrieril · 2025-04-06T17:14:40Z

In terms of the experiment, my current take is that using patterns for this doesn't pull its weight. I expect we won't allow guaranteed-direct-jump using a non-fully-const value like:

#[loop_match]
loop {
    state = 'blk: {
        match state {
            None => {
                let r = random();
                const continue 'blk Some(r);
            },
            Some(_) => break,
        }
    };
}

The reason being that this requires inspecting the expression which is a weird sort of abstraction break (you wouldn't be able to do let x = Some(r); const continue 'blk x;).

And without that, using patterns seems barely better than jump labels.

Co-authored-by: Travis Cross <[email protected]>

traviscross · 2025-04-06T23:09:55Z

Thanks for having a look at the MIR lowering. That was indeed what I most wanted your eyes on.

For one, I believe the static pattern-matching should use the const-eval interpreter instead of manually operating on valtrees. For two, it should not duplicate the work of match lowering; instead, BuiltMatchTree should track the tests cases that lead to each branch so that we can reuse them. This is the only way or-patterns can be supported properly.

@folkertdev: What are your thoughts on this and on how and when you want to approach it?

In terms of the experiment... I expect we won't allow guaranteed-direct-jump using a non-fully-const value... The reason being that this requires inspecting the expression which is a weird sort of abstraction break (you wouldn't be able to do let x = Some(r); const continue 'blk x;).

The current implementation only supports integers and enums without fields as the scrutinee/state. Clearly we should never accept the abstraction break that you mention.

And without that, using patterns seems barely better than jump labels.

Whether or not we use patterns, what seems fairly elegant to me about this approach is that it keeps the jump labels in the value space which means that const computations can be performed to choose the jump target and that arbitrary other computations can be performed to choose the target when needed by leaving off the #[const_continue] (and accepting the codegen implications of that) without having to switch the entire block to a different syntax or engage in other workarounds.

Of course, I can imagine ways we could keep the jump labels in the value space without using patterns at all (rather than using them in restricted form by restricting the scrutinee type), and it'd be interesting to think through the pros and cons of that.

bjorn3 · 2025-04-07T08:00:29Z

In terms of the experiment... I expect we won't allow guaranteed-direct-jump using a non-fully-const value... The reason being that this requires inspecting the expression which is a weird sort of abstraction break (you wouldn't be able to do let x = Some(r); const continue 'blk x;).

The current implementation only supports integers and enums without fields as the scrutinee/state. Clearly we should never accept the abstraction break that you mention.

The argument of the const continue is a const value. Either a literal const {} block or for convenience directly an integer or enum literal.

For one, I believe the static pattern-matching should use the const-eval interpreter instead of manually operating on valtrees. For two, it should not duplicate the work of match lowering; instead, BuiltMatchTree should track the tests cases that lead to each branch so that we can reuse them. This is the only way or-patterns can be supported properly.

Const eval needs a fully built MIR body, but we are currently building a MIR body, so there is nothing const eval can run on. As for BuiltMatchTree, that is already used and static_pattern_match handles or patterns already. Everything is just intentionally limited to integers, bools and fieldless enums to make the implementation easier.

rustbot assigned traviscross Mar 21, 2025

rustbot added A-attributes Area: Attributes (`#[…]`, `#![…]`) S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Mar 21, 2025

traviscross reviewed Mar 22, 2025

View reviewed changes

rustbot added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Mar 22, 2025

This comment has been minimized.

Sign in to view

folkertdev commented Mar 22, 2025

View reviewed changes

compiler/rustc_mir_build/src/builder/expr/into.rs Outdated Show resolved Hide resolved

compiler/rustc_mir_build/src/builder/expr/into.rs Outdated Show resolved Hide resolved

compiler/rustc_middle/src/thir.rs Outdated Show resolved Hide resolved

folkertdev commented Mar 24, 2025

View reviewed changes

compiler/rustc_mir_build/src/builder/expr/stmt.rs Outdated Show resolved Hide resolved

folkertdev force-pushed the loop_match_attr branch from 368f722 to a89dcbe Compare March 27, 2025 09:39

This comment has been minimized.

Sign in to view

folkertdev commented Apr 1, 2025

View reviewed changes

compiler/rustc_mir_build/src/builder/scope.rs Outdated Show resolved Hide resolved

This comment has been minimized.

Sign in to view

folkertdev force-pushed the loop_match_attr branch from f294773 to 6fe6909 Compare April 1, 2025 16:53

This comment has been minimized.

Sign in to view

bjorn3 and others added 11 commits April 3, 2025 17:04

Add #[loop_match] for improved DFA codegen

72d0703

Co-authored-by: Folkert de Vries <[email protected]>

Apply suggestions from code review

3c7722d

Co-authored-by: Travis Cross <[email protected]>

add comments to tests

eb4ed17

remove some comments that are now inaccurate

f777efa

static pattern matching when lowering #[const_continue]

1b93888

clarify an unreachable branch

5e73b2d

add error for unknown jump target

b8c5752

emit an error when a match arm has a guard

044acdf

store the span of the match expression, and use it for diagnostics

fc982a2

add comments

5eb72b7

use Size::truncate

3716bed

folkertdev and others added 9 commits April 3, 2025 17:04

fix docs for ConstContinuableScope

271b4e0

[WIP] Support const blocks in const_continue

70db07c

Handle plain enum values

c3a1340

refactor valtree logic

b94a8e2

error on unsupported state type

fbc9f9b

remove an unwrap

e99d54a

use span_bug instead of todo

ba2f37b

simplify logic for when a #[loop_match] state type is valid

157cc9f

fix rustc updating under our feet

7d88da4

folkertdev force-pushed the loop_match_attr branch from b3a87ed to 7d88da4 Compare April 4, 2025 08:22

support boolean and character patterns

332e511

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Apr 4, 2025

traviscross mentioned this pull request Apr 6, 2025

Tracking issue for way to express intraprocedural finite state machines #132306

Open

7 tasks

traviscross reviewed Apr 6, 2025

View reviewed changes

rustbot added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 6, 2025

rustbot assigned Nadrieril and unassigned traviscross Apr 6, 2025

traviscross self-assigned this Apr 6, 2025

traviscross added the I-lang-nominated Nominated for discussion during a lang team meeting. label Apr 6, 2025

Apply suggestions from code review

043be33

Co-authored-by: Travis Cross <[email protected]>

folkertdev force-pushed the loop_match_attr branch from e92034b to 043be33 Compare April 6, 2025 17:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `#[loop_match]` for improved DFA codegen #138780

Add `#[loop_match]` for improved DFA codegen #138780

folkertdev commented Mar 21, 2025 •

edited by traviscross

Loading

rustbot commented Mar 21, 2025

traviscross left a comment

This comment has been minimized.

folkertdev left a comment

rustbot commented Mar 24, 2025

bors commented Mar 26, 2025

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

folkertdev commented Apr 4, 2025

traviscross Apr 6, 2025

traviscross Apr 6, 2025

Nadrieril Apr 6, 2025

traviscross commented Apr 6, 2025 •

edited

Loading

rustbot commented Apr 6, 2025

Nadrieril commented Apr 6, 2025

Nadrieril commented Apr 6, 2025

traviscross commented Apr 6, 2025 •

edited

Loading

bjorn3 commented Apr 7, 2025

Add #[loop_match] for improved DFA codegen #138780

Are you sure you want to change the base?

Add #[loop_match] for improved DFA codegen #138780

Conversation

folkertdev commented Mar 21, 2025 • edited by traviscross Loading

current state

future work

maybe future work

rustbot commented Mar 21, 2025

traviscross left a comment

Choose a reason for hiding this comment

This comment has been minimized.

folkertdev left a comment

Choose a reason for hiding this comment

rustbot commented Mar 24, 2025

bors commented Mar 26, 2025

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

folkertdev commented Apr 4, 2025

traviscross Apr 6, 2025

Choose a reason for hiding this comment

traviscross Apr 6, 2025

Choose a reason for hiding this comment

Nadrieril Apr 6, 2025

Choose a reason for hiding this comment

traviscross commented Apr 6, 2025 • edited Loading

rustbot commented Apr 6, 2025

Nadrieril commented Apr 6, 2025

Nadrieril commented Apr 6, 2025

traviscross commented Apr 6, 2025 • edited Loading

bjorn3 commented Apr 7, 2025

Add `#[loop_match]` for improved DFA codegen #138780

Add `#[loop_match]` for improved DFA codegen #138780

folkertdev commented Mar 21, 2025 •

edited by traviscross

Loading

traviscross commented Apr 6, 2025 •

edited

Loading

traviscross commented Apr 6, 2025 •

edited

Loading