Audio file
demo.mov
Recognized speech text
ちょっと見ていきましょう。
This model requires additional module.
pip3 install librosa
Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.
For the sample wav,
$ python3 kotoba-whisper.py
If you want to specify the audio, put the file path after the --input
option.
$ python3 kotoba-whisper.py --input AUDIO_FILE
This --chunk_length
option should be used when a single large audio file is being transcribed.
$ python3 kotoba-whisper.py --chunk_length CHUNK_LENGTH
Pytorch
ONNX opset=17
kotoba-whisper-v1.0_encoder.onnx.prototxt
kotoba-whisper-v1.0_decoder.onnx.prototxt