You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What is your question?
I test the example 06_splitK_gemm in 2080ti gpu,I want to know how does the threadblock read the global memory,I debug into the cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h file,I find the typename Mma::IteratorA iterator_A and the mma(gemm_k_iterations,accumulators,iterator_A,iterator_B,accumulators),but I can't find the mma‘s code implementation .
where can I find the mma code implementation ?
The text was updated successfully, but these errors were encountered:
What is your question?
I test the example 06_splitK_gemm in 2080ti gpu,I want to know how does the threadblock read the global memory,I debug into the cutlass/include/cutlass/gemm/kernel/gemm_splitk_parallel.h file,I find the typename Mma::IteratorA iterator_A and the mma(gemm_k_iterations,accumulators,iterator_A,iterator_B,accumulators),but I can't find the mma‘s code implementation .
where can I find the mma code implementation ?
The text was updated successfully, but these errors were encountered: