Skip to content

some codes and data for spoken language recognition based on the Mozilla Common Voice dataset

Notifications You must be signed in to change notification settings

sergeyvilov/MCV-spoken-language-recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

MCV-spoken-language-recognition

mix_40000_records.tsv.gz - train/test split for spoken language recognition on Mozilla Common Voice dataset

audio_models.py - models used for spoken language recognition on Mozilla Common Voice dataset

tuplemax_loss.py - Pytorch implementation of Tuplemax loss from Wan, Li, et al. "Tuplemax loss for language identification." ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019.

See my article about spoken language recognition on Medium:

  1. Data selection and preprocessing
  2. Choosing optimal model
  3. Audio transformations

About

some codes and data for spoken language recognition based on the Mozilla Common Voice dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages