Skip to content

Latest commit

 

History

History
103 lines (74 loc) · 4.48 KB

SHARED.md

File metadata and controls

103 lines (74 loc) · 4.48 KB

Shared Model Cards

Prerequisites of using

  • This document is serving as a quick lookup table for the community training/finetuning result, with various language support.
  • The models in this repository are open source and are based on voluntary contributions from contributors.
  • The use of models must be conditioned on respect for the respective creators. The convenience brought comes from their efforts.

Welcome to share here

  • Have a pretrained/finetuned result: model checkpoint (pruned best to facilitate inference, i.e. leave only ema_model_state_dict) and corresponding vocab file (for tokenization).
  • Host a public huggingface model repository and upload the model related files.
  • Make a pull request adding a model card to the current page, i.e. src\f5_tts\infer\SHARED.md.

Supported Languages

Multilingual

F5-TTS Base @ pretrain @ zh & en

Model 🤗Hugging Face Data (Hours) Model License
F5-TTS Base ckpt & vocab Emilia 95K zh&en cc-by-nc-4.0
MODEL_CKPT: hf://SWivid/F5-TTS/F5TTS_Base/model_1200000.safetensors
VOCAB_FILE: hf://SWivid/F5-TTS/F5TTS_Base/vocab.txt

Other infos, e.g. Author info, Github repo, Link to some sampled results, Usage instruction, Tutorial (Blog, Video, etc.) ...

English

Finnish

Finnish Common_Voice Vox_Populi @ finetune @ fi

Model 🤗Hugging Face Data Model License
F5-TTS Finnish ckpt & vocab Common Voice, Vox Populi cc-by-nc-4.0
MODEL_CKPT: hf://AsmoKoskinen/F5-TTS_Finish_Model/model_common_voice_fi_vox_populi_fi_20241206.safetensors
VOCAB_FILE: hf://AsmoKoskinen/F5-TTS_Finish_Model/vocab.txt

French

French LibriVox @ finetune @ fr

Model 🤗Hugging Face Data (Hours) Model License
F5-TTS French ckpt & vocab LibriVox cc-by-nc-4.0
MODEL_CKPT: hf://RASPIAUDIO/F5-French-MixedSpeakers-reduced/model_last_reduced.pt
VOCAB_FILE: hf://RASPIAUDIO/F5-French-MixedSpeakers-reduced/vocab.txt

Japanese

F5-TTS Japanese @ pretrain/finetune @ ja

Model 🤗Hugging Face Data (Hours) Model License
F5-TTS Japanese ckpt & vocab Emilia 1.7k JA & Galgame Dataset 5.4k cc-by-nc-4.0
MODEL_CKPT: hf://Jmica/F5TTS/JA_8500000/model_8499660.pt
VOCAB_FILE: hf://Jmica/F5TTS/JA_8500000/vocab_updated.txt

Mandarin

Spanish

F5-TTS Spanish @ pretrain/finetune @ es

Model 🤗Hugging Face Data (Hours) Model License
F5-TTS Spanish ckpt & vocab Voxpopuli & Crowdsourced & TEDx, 218 hours cc0-1.0
  • @jpgallegoar GitHub repo, Jupyter Notebook and Gradio usage for Spanish model.