You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Add
map_location='cuda'
. You can set'cpu'
instead for CPU only modeSIMPLE-NN_v2/simple_nn_v2/models/data_handler.py
Line 60 in 28007d6
Remove all
.to(device)
in training codeSet
pin_memory=False
inSIMPLE-NN_v2/simple_nn_v2/models/data_handler.py
Line 223 in 28007d6
pin_memory
only works CPU->GPU data transport.Add
.to(device)
for scale and pca (Since all data is uploaded in the target device and collate_fn also works on that device).Add
.to(device)
for all tensors generated in the code.In my case, training speed increase by 5~10 times.
Beta Was this translation helpful? Give feedback.
All reactions