Name		Name	Last commit message	Last commit date
parent directory ..
tokenizer		tokenizer
LICENSE		LICENSE
README.md		README.md
audio_utils.py		audio_utils.py
demo.wav		demo.wav
kotoba-whisper.py		kotoba-whisper.py
tokenizer_utils.py		tokenizer_utils.py

README.md

Kotoba-Whisper

Input

Audio file

demo.mov

Output

Recognized speech text

ちょっと見ていきましょう。

Requirements

This model requires additional module.

pip3 install librosa

Usage

Automatically downloads the onnx and prototxt files on the first run. It is necessary to be connected to the Internet while downloading.

For the sample wav,

$ python3 kotoba-whisper.py

If you want to specify the audio, put the file path after the --input option.

$ python3 kotoba-whisper.py --input AUDIO_FILE

This --chunk_length option should be used when a single large audio file is being transcribed.

$ python3 kotoba-whisper.py --chunk_length CHUNK_LENGTH

Reference

Hugging Face - Kotoba-Whisper

Framework

Pytorch

Model Format

ONNX opset=17

Netron

kotoba-whisper-v1.0_encoder.onnx.prototxt
kotoba-whisper-v1.0_decoder.onnx.prototxt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kotoba-whisper

kotoba-whisper

README.md

Kotoba-Whisper

Input

Output

Requirements

Usage

Reference

Framework

Model Format

Netron

Files

kotoba-whisper

Directory actions

More options

Directory actions

More options

Latest commit

History

kotoba-whisper

Folders and files

parent directory

README.md

Kotoba-Whisper

Input

Output

Requirements

Usage

Reference

Framework

Model Format

Netron