-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Config/Model Checkpoint Pairing #47
Comments
Yes, you are right, if you want to infer only, no adjustments needed. |
Thanks! If I can ask, for the large model trained on speech/audio/music the provided checkpoint is WavTokenizer-large-speech-75token. It seems only speech, am I wrong? |
WavTokenizer-Large-unify-40token support speech, audio, music. WavTokenizer-large-speech-75token support speech only up to now. |
Thank you. The link here show an empty repo, do you plan to release the model? |
Greetings. I will comment here since this issue is related with the problem I am facing. Method When trying to use these config files by calling
So these config files are not suitable for loading the pretrained models for inference either or I am doing something wrong. I believe the provided examples in the main Thanks in advance |
Hi!
First of all, thank you for your excellent work! I am currently trying to use pre-trained WavTokenizer models and I wanted to confirm the correct pairing between the available configuration files and the model checkpoints before proceeding with my experiments.
Based on the README and available configuration files, I have made the following assumptions for pairing each config with its corresponding model checkpoint. Could you please confirm if this is correct?
wavtokenizer_smalldata_frame40_3s_nq1_code4096_dim512_kmeans200_attn.yaml
wavtokenizer_smalldata_frame75_3s_nq1_code4096_dim512_kmeans200_attn.yaml
Inferred Pairing:
wavtokenizer_smalldata_frame40_3s_nq1_code4096_dim512_kmeans200_attn.yaml
should be paired with the following models:WavTokenizer-small-600-24k-4096
WavTokenizer-large-600-24k-4096
wavtokenizer_smalldata_frame75_3s_nq1_code4096_dim512_kmeans200_attn.yaml
should be paired with the following models:WavTokenizer-small-320-24k-4096
WavTokenizer-medium-320-24k-4096
WavTokenizer-large-320-24k-4096
Could you please confirm if this pairing is correct or if any adjustments are needed?
Thank you for your help!
The text was updated successfully, but these errors were encountered: