Traning on wenetspeech couldn‘t converge #28

dyyoungg · 2024-09-11T08:36:02Z

I compared two experimental data setups.
setting 1: WenetSpeech（Chinese）only
setting 2: Wenet + Giga （about 1：1， Chinese + English）

It's interesting that training on setting 1 can't decrease normally （blue curve in the following image）， while setting 2 mixed with English can converge normally.
Have you observed this phenomenon in your experiments?

jishengpeng · 2024-09-12T04:32:39Z

I compared two experimental data setups. setting 1: WenetSpeech（Chinese）only setting 2: Wenet + Giga （about 1：1， Chinese + English）

It's interesting that training on setting 1 can't decrease normally （blue curve in the following image）， while setting 2 mixed with English can converge normally. Have you observed this phenomenon in your experiments?

This situation is somewhat unusual. You may use a small amount of Chinese data (approximately 500 hours) to verify whether this issue always arises when the model is trained on purely Chinese data.

wntg · 2024-09-14T07:11:56Z

I‘m intersting in Chinese too. Do you have any further results?

boltzmann-Li · 2024-11-25T04:06:39Z

WenetSpeech could be too noisy, you may want to start with AIShell3, then WenetSpeech4TTS.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Traning on wenetspeech couldn‘t converge #28

Traning on wenetspeech couldn‘t converge #28

dyyoungg commented Sep 11, 2024

jishengpeng commented Sep 12, 2024

wntg commented Sep 14, 2024

boltzmann-Li commented Nov 25, 2024

Traning on wenetspeech couldn‘t converge #28

Traning on wenetspeech couldn‘t converge #28

Comments

dyyoungg commented Sep 11, 2024

jishengpeng commented Sep 12, 2024

wntg commented Sep 14, 2024

boltzmann-Li commented Nov 25, 2024