You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi I have a small question about the various knowledge matching strategies.
In the experiment, bottom-up strategy is way better than top-down one, and why do you think so?
In my knowledge, upper layer may predict more precise logits, which can be more beneficial to guide lower layers to mimic precise logits, so that can lead to top-down strategy surpass bottom-up.
By the way, thanks for providing such a wonderful work! Im very impressed from your research.
The text was updated successfully, but these errors were encountered:
Hi I have a small question about the various knowledge matching strategies.
In the experiment, bottom-up strategy is way better than top-down one, and why do you think so?
In my knowledge, upper layer may predict more precise logits, which can be more beneficial to guide lower layers to mimic precise logits, so that can lead to top-down strategy surpass bottom-up.
By the way, thanks for providing such a wonderful work! Im very impressed from your research.
The text was updated successfully, but these errors were encountered: