Skip to content
This repository has been archived by the owner on Nov 1, 2024. It is now read-only.

Latest commit

 

History

History
14 lines (8 loc) · 526 Bytes

README.md

File metadata and controls

14 lines (8 loc) · 526 Bytes

Speaker Diarization with Pyannote and Whisper.cpp

Uses Whisper.cpp to transcribe audio, and then performs speaker diarization with Pyannote.

Usage

Place video/audio files in input/, and then run main.py with docker compose up.

Notes

Performance for diarization seems to be improved when segment length for whisper is decreased, such as --max-len 50.