Waves offers cutting-edge text-to-speech technology featuring hyper-realistic voices that sound virtually indistinguishable from human speech. With incredibly low latency of just 100ms, it enables real-time applications across various domains. The system supports multiple languages and accents. This repository shows a collection of example projects, each in its own directory, demonstrating how to implement and leverage these advanced models for different use cases. Comprehensive setup instructions are provided, making it easy for developers to integrate Waves' powerful text-to-speech capabilities into their own applications.
Lightning is the world’s fastest text to speech model, generating around 10 seconds of hyper-realistic audio in just 100ms, all at once, no streaming.
It supports 7 Voices, learn more about it on 🌊 Docs.
Thunder is streaming text to speech model with gives hyper realistic audios with time-to-first-bytes as low as 200ms.
It supports 20 Voices, learn more about it on 🌊 Docs.
- Get the 🌊 API Key
git clone https://github.com/smallest-inc/waves-examples.git
- Navigate to the specific example directory you're interested in.
- Follow the instructions in the local README.md file to set up and run the example.
We welcome contributions from the community! If you’d like to contribute, you can file issues under this repo, open a PR, or chat with us in smallest.ai's Discord Community.
Our TTS models are being integrated into various platforms and tools. Keep an eye on the following repositories for official integrations and updates:
For questions, issues, or support:
- Email: [email protected]
- Join our community: Discord