Skip to content

Commit

Permalink
Correct mma_sp_m16n8k16 implementation
Browse files Browse the repository at this point in the history
  • Loading branch information
eschnett committed Jan 16, 2024
1 parent 119f40b commit a189cf4
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/IndexSpaces.jl
Original file line number Diff line number Diff line change
Expand Up @@ -2010,7 +2010,7 @@ CUDA.@device_override function mma_sp_m16n8k16(
C[1].val % Int32,
C[2].val % Int32,
e.val % Int32,
F % Int32,
f % Int32,
)
return (Float16x2(D[1] % UInt32), Float16x2(D[2] % UInt32))
end
Expand Down

0 comments on commit a189cf4

Please sign in to comment.