Skip to content

Latest commit

 

History

History
23 lines (11 loc) · 797 Bytes

README.md

File metadata and controls

23 lines (11 loc) · 797 Bytes

The Reentry model

A PyTorch implementation of the Selective Listening by Synchronizing Speech with Lips

Update:

Project Structure

/data: Scripts to pre-process the voxceleb2 dataset.

/pretrained_slsyn: The pre-trained SLSyn network to extract embeddings.

/src: The training scripts of the slsyn network and the reentry model.

Pretrained weights

If you need the pretrained SLSyn network weights, please email me using your organizational email id. My email address is pan_zexu at u.nus.edu