You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What is your question?
Hi, I want to define a new convolution2d kernel Fprop based on the bare convolution kernel.
My idea is to modify the weights or filters before the convolution operation.
I would need a little help to know or understand how Cutlass allows the definition of a new custom kernel and how it is executed later in the GPU (my GPU is a Tesla v100).
I am a little confused by looking at examples like the number 9 on how I must perform the modification of the weights or filters because it seems only to execute templates to other templates like default convolution but no operation is performed.
Any help or link to a clarifier example on how to program my kernel and later use it or a base explanation would be appreciated.
Thank you.
The text was updated successfully, but these errors were encountered:
When you refer to main loop, which code segment do you mean? In the .cu of the example, no loop performs the convolution; it is only in the .h file line 309. Is this?
What is your question?
Hi, I want to define a new convolution2d kernel Fprop based on the bare convolution kernel.
My idea is to modify the weights or filters before the convolution operation.
I would need a little help to know or understand how Cutlass allows the definition of a new custom kernel and how it is executed later in the GPU (my GPU is a Tesla v100).
I am a little confused by looking at examples like the number 9 on how I must perform the modification of the weights or filters because it seems only to execute templates to other templates like default convolution but no operation is performed.
Any help or link to a clarifier example on how to program my kernel and later use it or a base explanation would be appreciated.
Thank you.
The text was updated successfully, but these errors were encountered: