Some audio files result in NAN values in aperiodicity #50

Kal213 · 2020-06-24T22:11:40Z

When using wav2world (or DIO and D4C), I've noticed that sometimes the aperiodicity returns with nan values in it. This causes me issues when I try to synthesize the audio back.

I've been unable to find exactly what causes it, but I've found that taking the absolute value of the audio data before inputting it will "fix" the issue.

I've attached a short python script and an audio file that causes the problem on my build. If you have any ideas what's causing this or how to fix it please let me know!

As a final note, some audio samples with negative values work well with this. It's very odd, and maybe just an issue with WORLD itself.

example.zip

JeremyCCHsu · 2020-06-25T06:11:31Z

Hi, @Kal213 , thank you for reporting this issue. I tested it out and found it really strange.

It seems that casting the np.float32 data read from librosa.read with the .astype(np.float64) method in numpy caused the issue.

For now, I can only suggest that you load the wav file using soundfile.read which returns np.float64 by default.
world.wav2world seems to work fine with the example you attached and does result in nan APs.

If you have other solutions, or find this workaround not working, please share your findings with us. Thanks.

jerry-cj-chang · 2020-09-22T10:04:16Z

I guess it's because librosa.load will read and interpolate points to the specified sample rate.
Some of these interpolated points cannot be recast to int16, which i guess is the problem of this NAN issue.
You can fix this by casting a float32 to int16 and then to double, or not set the sampling rate argument in librosa.load to "None", so librosa won't do resampling.

tshmak · 2023-01-03T04:30:37Z

I also just discovered this problem, and yes, I found the solution to be the same as jerry-cj-chang's. So your float has to be recast to 16bit and then to 64bit before passing it to world. Hope there's a proper fix soon.

JeremyCCHsu mentioned this issue Jun 25, 2020

NaN sometimes introduced in coarse aperiodicity estimation mmorise/World#92

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some audio files result in NAN values in aperiodicity #50

Some audio files result in NAN values in aperiodicity #50

Kal213 commented Jun 24, 2020 •

edited

Loading

JeremyCCHsu commented Jun 25, 2020

jerry-cj-chang commented Sep 22, 2020

tshmak commented Jan 3, 2023

Some audio files result in NAN values in aperiodicity #50

Some audio files result in NAN values in aperiodicity #50

Comments

Kal213 commented Jun 24, 2020 • edited Loading

JeremyCCHsu commented Jun 25, 2020

jerry-cj-chang commented Sep 22, 2020

tshmak commented Jan 3, 2023

Kal213 commented Jun 24, 2020 •

edited

Loading