You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Title: Unable to Reproduce DKD Experiment Results on Tesla T4 Server Using Repository Code
Body:Dear maintainers,
I recently attempted to replicate the experiment results of Distilled Knowledge Distillation (DKD) mentioned in the paper, using your repository's code on my Tesla T4 server. Unfortunately, I was not able to achieve the same results as documented.
Could you please advise if there are any specific configurations or steps that I might have missed? Here is what I have done so far:
Set up the environment as per the documentation.
Pulled the latest code from the master branch of the repository.
Followed the instructions in the README file to set up the DKD experiment.
Ran the experiment with the default settings provided.
However, the results were significantly different from those reported in the paper. I would appreciate any guidance or recommendations to address this issue.
Thank you for your time and assistance.
Best regards,
Yintel
The text was updated successfully, but these errors were encountered:
Did you run the experiment on 8 GPUs? The batch-size on each GPU could be very small if running on 8 GPUs. The results reported for CIFAR-100 is run on only 1 GPU.
Thank You. I will try it on only one gpu. Another question is, if I want to set a large batch size for boosting training speed, learning rate should increase or decrease?
Thank You. I will try it on only one gpu. Another question is, if I want to set a large batch size for boosting training speed, learning rate should increase or decrease?
Title: Unable to Reproduce DKD Experiment Results on Tesla T4 Server Using Repository Code
Body:Dear maintainers,
I recently attempted to replicate the experiment results of Distilled Knowledge Distillation (DKD) mentioned in the paper, using your repository's code on my Tesla T4 server. Unfortunately, I was not able to achieve the same results as documented.
Could you please advise if there are any specific configurations or steps that I might have missed? Here is what I have done so far:
Set up the environment as per the documentation.
Pulled the latest code from the master branch of the repository.
Followed the instructions in the README file to set up the DKD experiment.
Ran the experiment with the default settings provided.
However, the results were significantly different from those reported in the paper. I would appreciate any guidance or recommendations to address this issue.
Thank you for your time and assistance.
Best regards,
Yintel
The text was updated successfully, but these errors were encountered: