Tiling helps increase the performance by using a device shared memory
https://www.youtube.com/watch?v=tGu5DyIlofY
http://www.umiacs.umd.edu/~ramani/cmsc828e_gpusci/Lecture5.pdf
Tiling helps increase the performance by using a device shared memory
http://www.umiacs.umd.edu/~ramani/cmsc828e_gpusci/Lecture5.pdf