Skip to content

a comfyui cuatom node for audio subtitling based on whisperX and translators

License

Notifications You must be signed in to change notification settings

naiver-me/ComfyUI-WhisperX-Plus

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ComfyUI-WhisperX

a comfyui cuatom node for audio subtitling based on whisperX and translators

webpage

Features

  • export srt file for subtitle was supported
  • translate was supported by translators with huge number engine
  • mutiple speaker diarization was supported by pyannote-audio
  • huge comfyui custom nodes can merge in whisperx

How to use

make sure ffmpeg is worked in your commandline for Linux

apt update
apt install ffmpeg

for Windows,you can install ffmpeg by WingetUI automatically

then!

git clone https://github.com/AIFSH/ComfyUI-WhisperX.git
cd ComfyUI-WhisperX
pip install -r requirements.txt

weights will be downloaded from huggingface automatically! if you in china,make sure your internet attach the huggingface or if you still struggle with huggingface, you may try follow hf-mirror to config your env.

to speaker diarization, you need!

  1. Accept pyannote/segmentation-3.0 user conditions
  2. Accept pyannote/speaker-diarization-3.1 user conditions
  3. Create access token at hf.co/settings/tokens.

Tutorial

Demo

QQ Group && Donate

QQ Group

Thanks

About

a comfyui cuatom node for audio subtitling based on whisperX and translators

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 90.0%
  • JavaScript 10.0%