The Wav2Lip node is a custom node for ComfyUI that allows you to perform lip-syncing on videos using the Wav2Lip model. It takes an input video and an audio file and generates a lip-synced output video.
- Lip-syncing of videos using the Wav2Lip model
- Support for various face detection models
images
: Input video frames (required)audio
: Input audio file (required)mode
: Processing mode, either "sequential" or "repetitive" (default: "sequential")face_detect_batch
: Batch size for face detection (default: 8)
images
: Lip-synced output video framesaudio
: Output audio file
-
Clone the repository to custom_nodes folder:
git clone https://github.com/yourusername/wav2lip-comfyui.git
-
Install the required dependencies:
pip install -r requirements.txt
To use the Wav2Lip node, you need to download the required models separately. Please follow these steps:
- Download the wav2lip model: -1-
- Place the
.pth model file in the
custom_nodes\ComfyUI_wav2lip\Wav2Lip\checkpoints` folder - Start or restart ComfyUI.
-
Add the Wav2Lip node to your ComfyUI workflow.
-
Connect the input video frames and audio file to the corresponding inputs of the Wav2Lip node.
-
Adjust the node settings according to your requirements:
- Set the
mode
to "sequential" or "repetitive" based on your video processing needs. - Adjust the
face_detect_batch
size if needed.
- Set the
-
Execute the ComfyUI workflow to generate the lip-synced output video.
In case of an endless loop in the VideoCombine node when you connect audio to the node, you can download the 7zip file containing the fixed node from the ComfyUI-VideoHelperSuite-fix folder in the repository and replace the existing one.
- In the repository there are ComfyUI-VideoHelperSuite-fix folder.
- find the ComfyUI-VideoHelperSuite.7zip file in the folder.
- Extract the contents of the ComfyUI-VideoHelperSuite.7zip folder to a temporary location on your computer.
- Locate the existing ComfyUI-VideoHelperSuite node in your ComfyUI/custom_nodes folder.
- Replace the existing ComfyUI-VideoHelperSuite node with the fix one.
- Restart ComfyUI to apply the changes.
Thanks to ArtemM, Wav2Lip, PIRenderer, GFP-GAN, GPEN, ganimation_replicate, STIT for sharing their code.
- StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN (ECCV 2022)
- CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior (CVPR 2023)
- SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation (CVPR 2023)
- DPE: Disentanglement of Pose and Expression for General Video Portrait Editing (CVPR 2023)
- 3D GAN Inversion with Facial Symmetry Prior (CVPR 2023)
- T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations (CVPR 2023)