You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for publishing your work and making your code available online. It is of great value to the audio community.
I was curious about how using more or less quantizers affects the distance between the continuous and quantized embeddings in the high-dimensional embedding space. So I produced this code:
And I was very surprised to see that the norm is increasing with i! Do you have any explanation?
I understand that the distance to code entries is computed in the 8d low-dimensional space, but the 1024d residual should still get smaller the more RVQ scales we use?
Note: I also joined the audio I used in this test and some reconstruction using different number of RVQ scales and it works well. Download link: audio_to_i.zip
The text was updated successfully, but these errors were encountered:
Dear Authors,
Thank you for publishing your work and making your code available online. It is of great value to the audio community.
I was curious about how using more or less quantizers affects the distance between the continuous and quantized embeddings in the high-dimensional embedding space. So I produced this code:
And I was very surprised to see that the norm is increasing with i! Do you have any explanation?
I understand that the distance to code entries is computed in the 8d low-dimensional space, but the 1024d residual should still get smaller the more RVQ scales we use?
Note: I also joined the audio I used in this test and some reconstruction using different number of RVQ scales and it works well. Download link: audio_to_i.zip
The text was updated successfully, but these errors were encountered: