Optimize fp8
linalg_ext.attention
by rework Q@K scaling (#18031)
#222
Job | Run time |
---|---|
0s |
fp8
linalg_ext.attention
by rework Q@K scaling (#18031)
#222
Job | Run time |
---|---|
0s |