You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Grid groups and grid sync were added in Numba 0.53.1. A short section on using these to implement a global barrier would be good, perhaps based around the example kernel from the documentation:
@cuda.jit(void(int32[:,::1]))defsequential_rows(M):
col=cuda.grid(1)
g=cuda.cg.this_grid()
rows=M.shape[0]
cols=M.shape[1]
forrowinrange(1, rows):
opposite=cols-col-1# Each row's elements are one greater than the previous rowM[row, col] =M[row-1, opposite] +1# Wait until all threads have written their column element,# and that the write is visible to all other threadsg.sync()
The text was updated successfully, but these errors were encountered:
Grid groups and grid sync were added in Numba 0.53.1. A short section on using these to implement a global barrier would be good, perhaps based around the example kernel from the documentation:
The text was updated successfully, but these errors were encountered: