Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[LLVMGPU] Add multi-row vector reduction configuration (#73)
This is to speed up matvec. The new configuration is experimental and only applied on ROCm targets.
- Loading branch information