You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi
I want to measure the quality of the synthetic audio with metric that need reference audio, like pesq, mcd etc.
I followed the preprocess code of melgan (librosa.load => trim 20db => mel), i find that some trimed original audio has difference length with the synthetic audio which make the pesq computation failed. Why does generated wavforms have difference length with the original waveform (trimed)
The text was updated successfully, but these errors were encountered:
Hi
I want to measure the quality of the synthetic audio with metric that need reference audio, like pesq, mcd etc.
I followed the preprocess code of melgan (librosa.load => trim 20db => mel), i find that some trimed original audio has difference length with the synthetic audio which make the pesq computation failed. Why does generated wavforms have difference length with the original waveform (trimed)
The text was updated successfully, but these errors were encountered: