-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Combine similar kernels using cooperative groups #97
Conversation
Here we use Euler mortar 3D as an example for benchmarking.
|
Here we use Euler mortar 3D as an example for benchmarking.
|
Here we use MHD mortar 3D as an example for benchmarking.
|
Several kernels are launched separately to achieve grid-wide synchronization. Cooperative groups can help combine these similar kernels into a single kernel launch to improve performance. Also, kernel size configurators for 1D, 2D, and 3D cooperative kernel launches are added.
Tasks:
cuda_prolong2mortars!
for 3Dcuda_mortar_flux!
withnonconservative_terms::False
for 3Dcuda_mortar_flux!
withnonconservative_terms::False
for 3D