StyleTTS2 Inferencer

This python package is a wrapper around the very nice StyleTTS2 TTS engine made by Aaron (Yinghao) Li et al. It currently supports converting a specified text into speech which is stored to a specified speech file.

How to install:

sudo apt-get install espeak-ng git
git lfs install
pip install . The model weights will be automatically downloaded during the first run.

Run with:

from styletts2_inferencer import StyleTTS2Inferencer

tts = StyleTTS2Inferencer() # defaults to using cuda/gpu
tts.tts_to_file(
    text="Hello! How are you?", # automatically switches to long form generation for multiline texts.
    filepath="test.wav" # should be an absolute path
)
# Output: 
# RTF = 0.410011
# Written output to file test.wav

See test.wav for an example output.

Improvements to make

Add parameter for speech expresseviness and number of diffusion steps as shown in the demo

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
src/styletts2_inferencer		src/styletts2_inferencer
.gitignore		.gitignore
LICENSE		LICENSE
pyproject.toml		pyproject.toml
readme.md		readme.md
test.wav		test.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StyleTTS2 Inferencer

How to install:

Run with:

Improvements to make

About

Releases

Packages

Languages

License

thaije/Styletts2_inferencer

Folders and files

Latest commit

History

Repository files navigation

StyleTTS2 Inferencer

How to install:

Run with:

Improvements to make

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages