Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use the chaining methods on PartialOrd for slices too #138881

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

scottmcm
Copy link
Member

@scottmcm scottmcm commented Mar 24, 2025

#138135 added these doc-hidden trait methods to improve the tuple codegen. This PR adds more implementations and callers so that the codegen for slice (and array) comparisons also improves.

@rustbot rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Mar 24, 2025

// CHECK-LABEL: @array_of_tuple_le
#[no_mangle]
pub fn array_of_tuple_le(a: &[(i16, u16); 2], b: &[(i16, u16); 2]) -> bool {
Copy link
Member Author

@scottmcm scottmcm Mar 24, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nightly, note the setl+setg from the cmp methods:

array_of_tuple_le:
        movzwl  (%rdi), %eax
        movzwl  (%rsi), %ecx
        cmpw    %cx, %ax
        setl    %cl
        setg    %al
        jne     .LBB0_1
        movzwl  2(%rdi), %eax
        cmpw    2(%rsi), %ax
        seta    %al
        sbbb    $0, %al
        testb   %al, %al
        jne     .LBB0_7
        jmp     .LBB0_4
.LBB0_1:
        subb    %cl, %al
        testb   %al, %al
        jne     .LBB0_7
.LBB0_4:
        movzwl  4(%rdi), %eax
        movzwl  4(%rsi), %ecx
        cmpw    %cx, %ax
        setl    %cl
        setg    %al
        jne     .LBB0_5
        movzwl  6(%rdi), %eax
        cmpw    6(%rsi), %ax
        seta    %al
        sbbb    $0, %al
        jmp     .LBB0_7
.LBB0_5:
        subb    %cl, %al
.LBB0_7:
        testb   %al, %al
        setle   %al
        retq

With this PR:

array_of_tuple_le:
	movzwl	(%rcx), %eax
	movzwl	(%rdx), %r8d
	cmpw	%r8w, %ax
	jne	.LBB1_1
	movzwl	2(%rcx), %r8d
	movzwl	2(%rdx), %r9d
	cmpw	%r9w, %r8w
	jne	.LBB1_2
	movzwl	4(%rcx), %eax
	movzwl	4(%rdx), %r8d
	cmpw	%r8w, %ax
	jne	.LBB1_1
	movzwl	6(%rcx), %r8d
	movzwl	6(%rdx), %r9d
	movb	$1, %al
	cmpw	%r9w, %r8w
	jne	.LBB1_2
	retq
.LBB1_1:
	cmpw	%r8w, %ax
	setle	%al
	retq
.LBB1_2:
	cmpw	%r9w, %r8w
	setb	%al
	retq

That's so nice -- straight line with no jumps taken if they're equal, otherwise jumps to the check it needs (dedup'd by consistent register choices), saves that, and returns.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note that even with #133984 landed, we still don't optimize away the [su]cmps in nightly today: https://rust.godbolt.org/z/ss5E7q6PT.

Comment on lines +55 to +61
// This is certainly not the obvious way to implement these methods.
// Unfortunately, using anything that looks at the discriminant means that
// LLVM sees a check for `2` (aka `ControlFlow<bool>::Continue(())`) and
// gets very distracted by that, ending up generating extraneous code.
// This should be changed to something simpler once either LLVM is smarter,
// see <https://github.com/llvm/llvm-project/issues/132678>, or we generate
// niche discriminant checks in a way that doesn't trigger it.
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So there's a backlink: llvm/llvm-project#132678

@scottmcm
Copy link
Member Author

Oh, I'm glad this passes in LLVM18 too.

As the one with context from #138135,
r? @Mark-Simulacrum

@scottmcm scottmcm marked this pull request as ready for review March 24, 2025 23:54
@scottmcm scottmcm removed the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Mar 25, 2025
@bors
Copy link
Collaborator

bors commented Mar 26, 2025

☔ The latest upstream changes (presumably #138956) made this pull request unmergeable. Please resolve the merge conflicts.

@scottmcm scottmcm force-pushed the more-chaining-ord branch from 17ab2d3 to 877f8de Compare March 31, 2025 07:21
@rust-log-analyzer

This comment has been minimized.

@scottmcm scottmcm force-pushed the more-chaining-ord branch from 877f8de to cf991d9 Compare March 31, 2025 07:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs Relevant to the library team, which will review and decide on the PR/issue.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants