Visual Keyword Spotting with Attention

This is the official implementation of the Transpotter paper. The code has been tested with Python version 3.6.8. Pre-trained checkpoints are also released.

Setup

pip install -r requirements.txt
Download the necessary checkpoints and the test set pickle files.
- cd checkpoints/
- sh download_models.sh

Feature extraction

Please follow the steps in this repository to extract the features for the LRS2, LRS3 test set. Please use the model trained on LRS2 + LRS3 for the feature extraction. The provided code and pre-trained models work with these features.

Computing the scores on LRS2 and LRS3 test sets

The following command is used to compute the scores mentioned in the last row of Table 1 of the paper

# LRS3
python test_and_score.py --data_root /path/to/lrs3/test/ --test_pkl_file checkpoints/lrs3_test.pkl --ckpt_path checkpoints/ft_lrs3.pth --localization

# LRS2
python test_and_score.py --data_root /path/to/lrs2/vid/ --test_pkl_file checkpoints/lrs2_test.pkl --ckpt_path checkpoints/ft_lrs2.pth --localization

Note:

--localization flag is only used to compute $mAP^{loc}$. The other metrics can be computed by not using this flag.
LRS3 test scores are off by 0.2 points than the ones mentioned in the paper, because of missing files.

Citation

Please cite the following paper if you find our work useful:

@inproceedings{prajwal2021visual,
  title={Visual Keyword Spotting with Attention},
  author={Prajwal, KR and Momeni, Liliane and Afouras, Triantafyllos and Zisserman, Andrew},
  booktitle={BMVC},
  year={2021}
}

Acknowledgements

We thank the author of The Annotated Transformer for the Transformer implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
checkpoints		checkpoints
.gitignore		.gitignore
README.md		README.md
config.py		config.py
dataloader_test.py		dataloader_test.py
models.py		models.py
modules.py		modules.py
requirements.txt		requirements.txt
test_and_score.py		test_and_score.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Visual Keyword Spotting with Attention

Setup

Feature extraction

Computing the scores on LRS2 and LRS3 test sets

Note:

Citation

Acknowledgements

About

Languages

prajwalkr/transpotter

Folders and files

Latest commit

History

Repository files navigation

Visual Keyword Spotting with Attention

Setup

Feature extraction

Computing the scores on LRS2 and LRS3 test sets

Note:

Citation

Acknowledgements

About

Resources

Stars

Watchers

Forks

Languages