-
Hello Thanks |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
If you can do fusion in the epilogue, do it because it is usually free. Mainloop fusion inevitably hurts the performance. Example 25 is for the case that you cannot do epilogue fusion for whatever reason. |
Beta Was this translation helpful? Give feedback.
-
you can preload into the shared memory if you want. writing iterators or not are both okay. you can just write a simple load function. |
Beta Was this translation helpful? Give feedback.
you can preload into the shared memory if you want. writing iterators or not are both okay. you can just write a simple load function.