CUTLASS is integrated into TVM #350

hwu36 · 2021-10-29T02:16:02Z

hwu36
Oct 29, 2021
Maintainer

apache/tvm@541f9f2

In this first commit, Turing and Ampere FP16 tensor core GEMMs are added into TVM.

Great work and thank you very much, TVM community!!!

hwu36 · 2021-11-07T02:19:19Z

hwu36
Nov 7, 2021
Maintainer Author

Now cutlass batched GEMM is integrated, too.

0 replies

masahi · 2021-12-15T19:10:04Z

masahi
Dec 15, 2021

Convolution kernels are also integrated. Results and analysis on end to end models are available at apache/tvm#9746.

It looks like TVM + cutlass is faster than TVM + cudnn. I'm not claiming that "cutlass is faster than cudnn on my gpu", it's just that the way we use cudnn is apparently not performing as well as it should be.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUTLASS is integrated into TVM #350

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{title}}

Select a reply

CUTLASS is integrated into TVM #350

hwu36 Oct 29, 2021 Maintainer

Replies: 2 comments

hwu36 Nov 7, 2021 Maintainer Author

masahi Dec 15, 2021

hwu36
Oct 29, 2021
Maintainer

hwu36
Nov 7, 2021
Maintainer Author

masahi
Dec 15, 2021