Revert "Optimize fp8
linalg_ext.attention
by rework Q@K scaling" …
#5
Job | Run time |
---|---|
7s | |
1s | |
8s |
fp8
linalg_ext.attention
by rework Q@K scaling" …
#5
Job | Run time |
---|---|
7s | |
1s | |
8s |