You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This effort should be undertaken only if linking to the optimized library versions investigated issue #66 does not reduce runtimes.
Most calls to matmul() in the computationally intensive kernels are for small matrices. These may not benefit from the compiler's implementation of matmul or linking to and calling BLAS or other library versions.
Write simple & possibly naive inline matmul() using loops
Experiment with loop optimization strategies (unrolling, fusion, inversion, etc.) & index ordering
The text was updated successfully, but these errors were encountered:
This effort should be undertaken only if linking to the optimized library versions investigated issue #66 does not reduce runtimes.
Most calls to
matmul()
in the computationally intensive kernels are for small matrices. These may not benefit from the compiler's implementation ofmatmul
or linking to and calling BLAS or other library versions.matmul()
using loopsThe text was updated successfully, but these errors were encountered: