-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Paper reproduction #7
Comments
Did you leverage the python environments in this repo?
|
I recommend you to use the experiment setting in this repo |
Did you run it with four graphics cards? |
I think the number of GPU cards is not important. As long as you can run the code with the setting |
How long does it take you to train an epoch? I used two 3090s and it took 3.5 hours to train an epoch. |
@xuliuwei May I chat with you about this issue? Thank you~ Wechat: margin333 |
I recently tried to reproduce the results of your paper, but I was unable to achieve the results of the paper. I trained with two 3090 students and batchsize of 12, and the learning rate is the same as that of the paper. May I ask whether the learning rate is the reason for the poor result? Does the learning rate have a big impact on your result? Can you provide your pre-training model?
The text was updated successfully, but these errors were encountered: