You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This template is only for question, not feature requests or bug reports.
I have thoroughly reviewed the project documentation and read the related paper(s).
I have searched for existing issues, including closed ones, no similar questions.
I confirm that I am using English to submit this report in order to facilitate communication.
Question details
I trained my dataset for about 40 hours with a single speaker using F5 for 35 epochs. My model synthesizes short words well, but when synthesizing longer text, it produces speech with a different accent. For example, Uzbek is similar to Turkish, so sometimes my model synthesizes Uzbek text with a Turkish accent. Additionally, there is some noise at the end of the output audio.
How can I resolve this issue?
The text was updated successfully, but these errors were encountered:
@Mustaphajudi can you tell me length of audio ? such as 10s or 15 s ?
and total 50 hours dataset is enough ?
RifatMamayusupov
changed the title
Can I use my own language IP for getting perfect model ?
Can I use my own language IPA for getting perfect model ?
Dec 11, 2024
Checks
Question details
I trained my dataset for about 40 hours with a single speaker using F5 for 35 epochs. My model synthesizes short words well, but when synthesizing longer text, it produces speech with a different accent. For example, Uzbek is similar to Turkish, so sometimes my model synthesizes Uzbek text with a Turkish accent. Additionally, there is some noise at the end of the output audio.
How can I resolve this issue?
The text was updated successfully, but these errors were encountered: