-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Are the model weights loaded into the ASTModel with stochasticity? #30
Comments
I found the following code in the ASTModel definition to be the source of differentiation. This leaves me wondering if the
Whenever we initialize a new layer (via Is this how it should function, or should initialization calls for |
Thanks for reporting this. This may be valid, the code has some problem in loading a pretrained SSAST model. |
I have a
best_audio_model.pth
folder that is the output of a pretrained SSAST model. My specific.pth
file can be found here for reproducibility. Note that this is the result of atiny
model fit to 5 of the classes found in the ESC-50 data set; I chose 5 classes to keep the computation and plotting simple.The following code (MWE) reveals that the weights are loaded differently in two model initializations, despite the parameters and data being the same. Specifically, the
mlp_head.1.weight
(and potentially more) appear to be different.Forgive me if my assumption is incorrect, but I would assume that the weights should be identical for
model_1
andmodel_2
. Additionally, I would assume thatmodel_1(inputs, 'ft_avgtok')
andmodel_2(inputs, 'ft_avgtok')
would give the same output for identicalinputs
, which it does not. Is there something simple that I'm missing here?Once again, thank you for all of your help, Yuan. I greatly appreciate it and am sorry to keeping bothering you.
The text was updated successfully, but these errors were encountered: