You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @QingZhuanya, thanks for your question. I think this issue is caused by the data loading and has something to do with the number of workers. You might need to try to set a different number of workers in your config to see how it works. For instance, set the following num_workers from 16 to 4:
Hi @QingZhuanya, thanks for your question. I think this issue is caused by the data loading and has something to do with the number of workers. You might need to try to set a different number of workers in your config to see how it works. For instance, set the following num_workers from 16 to 4:
Thanks for the answer, I changed it according to your method, but unfortunately it's still stuck. My environment is 8 A100. Is there any other method? Thank you.
Thank you for the excellent project. May I ask why I got stuck during phase1 training?
The text was updated successfully, but these errors were encountered: