You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
hi there, great work! I just wondering that can wavtokenizer be compared with whisper? since the Qwen-audio series uses whisper as an audio encoder, can wavtokenizer be used as an alternative, and where are its advantages and disadvantages?
thanks
The text was updated successfully, but these errors were encountered:
isruihu
changed the title
Comparison with Whipser
Comparison with Whisper
Sep 11, 2024
hi there, great work! I just wondering that can wavtokenizer be compared with whisper? since the Qwen-audio series uses whisper as an audio encoder, can wavtokenizer be used as an alternative, and where are its advantages and disadvantages?
thanks
The WavTokenizer can be applied to the Qwen-Audio series, as well as the recently introduced Mini-Omni and LLaMA-Omni series. For a comparison with Whisper, please refer to our previous response.
It is worth noting that, in contrast to Whisper, we believe that codec-based approaches hold greater potential for the future. The current challenge appears to lie in the WavTokenizer's encoder, which is not yet powerful enough—a limitation that we are actively working to address.
hi there, great work! I just wondering that can wavtokenizer be compared with whisper? since the Qwen-audio series uses whisper as an audio encoder, can wavtokenizer be used as an alternative, and where are its advantages and disadvantages?
thanks
The text was updated successfully, but these errors were encountered: