-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Help Wanted] Training from scratch on 1000 hours of Spanish does not work #565
Comments
need more info, e.g. detailed configuration of training setup |
The command I run (I have made a few changes to the repo around how to pass experiment names, etc, nothing to do with the actual training) is:
I run on 8*A100 GPUs Audio sounds like this after 500k steps https://whyp.it/tracks/231435/gibberish?token=TQ1fK. When I run the exact same setup but with |
so actually 4*a100 also you may refer to #548 , as you are using a 1000hours dataset |
Thank you, changing the learning rate and increasing the number of warmup updates has helped |
hi @rlenain can you share the dataset |
Unfortunately I cannot |
puedes compartir el modelo final? estoy necesitando un buen modelo en español y el que hay disponible solo tiene 250 horas de entrenamiento |
@rlenain Is your data in LJSpeech style? Which recipe exactly did you use for your training? Can you share you training configuration? |
Checks
Environment Details
Linux, Python=3.10
Steps to Reproduce
I ran
finetune_cli.py
with--finetune False
, so training from scratch, with 1000 hours of Spanish data and even after 500k steps, I am still not getting speech out. It sounds like the original speaker from the prompt sometimes, but the words being uttered are complete gibberish.Any help on this?
✔️ Expected Behavior
I would like to get speech.
❌ Actual Behavior
Gibberish
The text was updated successfully, but these errors were encountered: