You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to replicate the DINOv2 experiment from this table:
However, I had some questions here.
Is this student model trained with a 224x224 resolution with CPE? Does DINOv2 sees a 224x224 resolution image as well? For evaluation on ADE20k, is the input data of 512x512 resolution? And is it 224x224 for the CLIP distilled model?
What are the learning rate hyperparameters set for this experiment? Is it the same as the RADIO experiments? How many steps did you run this for?
Could you provide a little more clarity behind the Learning Rate Scheduler for the RADIO experiment? For example, the cycling aspect of CosineAnnealing scheduler.
The text was updated successfully, but these errors were encountered:
Is your goal to replicate the feature distillation experiment (the table you're pointing to), or the overall RADIO experiment?
Is this student model trained with a 224x224 resolution with CPE? Does DINOv2 sees a 224x224 resolution image as well? For evaluation on ADE20k, is the input data of 512x512 resolution? And is it 224x224 for the CLIP distilled model?
Yes. Yes. Input data is 518px (owing to patch-14 student). Yes.
What are the learning rate hyperparameters set for this experiment? Is it the same as the RADIO experiments? How many steps did you run this for?
Same as RADIO hparams. The difference in this table is which, if any, teacher models we apply feature distillation against. 600k steps.
Could you provide a little more clarity behind the Learning Rate Scheduler for the RADIO experiment? For example, the cycling aspect of CosineAnnealing scheduler.
I'm trying to replicate the DINOv2 experiment from this table:
However, I had some questions here.
The text was updated successfully, but these errors were encountered: