Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add evoformer example #289

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

Add evoformer example #289

wants to merge 2 commits into from

Conversation

harsh-nod
Copy link
Contributor

@harsh-nod harsh-nod commented Nov 22, 2024

This PR adds a sample evoformer kernel. To get this to be functional, the following changes were needed

  1. Enabling multiple batch dimensions by delinearizing workgroup_dim_2
  2. Modifications to thread shape analysis to allow edge cases

@harsh-nod harsh-nod force-pushed the evoformer branch 3 times, most recently from 165c51c to da1931f Compare November 22, 2024 02:48
Comment on lines +13 to +14
WORKGROUP_3 = index_symbol("$WG3")
WORKGROUP_4 = index_symbol("$WG4")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Doesn't the GPU typically only have 3 dims? 😆

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

They do :) This is just to keep it easier for the kernel author. All workgroup dimensions >= 2 are obtained by delinearizing workgroup id z.

@harsh-nod harsh-nod force-pushed the evoformer branch 5 times, most recently from 7aee493 to af5e9e8 Compare November 22, 2024 21:59
@harsh-nod harsh-nod force-pushed the evoformer branch 2 times, most recently from 261892b to 6c8f6c7 Compare November 25, 2024 04:55
Signed-off-by: Harsh Menon <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants