A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
pytorch
speech-recognition
vad
punctuation
whisper
audio-visual-speech-recognition
speaker-diarization
voice-activity-detection
conformer
pretrained-model
rnnt
dfsmn
paraformer
speechgpt
speechllm
-
Updated
Nov 30, 2024 - Python