-
Notifications
You must be signed in to change notification settings - Fork 278
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
insanely-fast-whisper backend #122
Comments
hi, thanks. Why do you need insanely fast whisper? As far as I know, it uses faster-whisper, same as ours. What ts_word function do you mean? can you give link to the line where it is specified? And yes, whisper-streaming needs word-level timestamps. |
thank you for your answer. it is in this line for faster-whisper backend: whisper_streaming/whisper_online.py Line 138 in 225f038
it is in this line for openai whisper backend: whisper_streaming/whisper_online.py Line 185 in 225f038
and i want to implement this function for faster whisper backend. |
Alright. ts_words is quite poorly documented here: whisper_streaming/whisper_online.py Line 80 in 225f038
Word is string. In faster-whisper, it may be a subword, like "space-delimited" can be in two parts: " space" and "-delimited", they should not be joined with a space: whisper_streaming/whisper_online.py Line 31 in 225f038
|
OK. I think the speed in insanely-fast-whisper is because of using large memory and batching. It's applicable only to the offline mode, you can chunk the whole long recording into small pieces and process them in parallel. In streaming mode, you can use batching like #55 and #42. It should speed a little but not too much. But anyway, feel free to try it and share your latency-quality test results compared to faster-whisper. Or make a PR and I may do the test. |
hi ,thanks for your great work!
i want to use the streaming mode with insanely fast whisper backend. i am adding this backend but i don't know what is the "ts_words" function? what is its utility and what it takes as input ?does the output of the whisper backend need to have timestamps?
can you please help me to understand this function?
any help is really appreciated.
The text was updated successfully, but these errors were encountered: