Skip to content

Bangla Text to Speech

Pre-release
Pre-release
Compare
Choose a tag to compare
@zabir-nabil zabir-nabil released this 03 Sep 09:15
· 17 commits to master since this release
6754e7b

Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library.

Features:

  • It works offline.
  • Works both on CPU, and GPU (faster inference on GPU).
  • It uses a simple parser which will translate numeric keys to corresponding phonetic representation. Example: ১৯৯৭ সালের ২১ জানুয়ারী তে আমার জন্ম হয় will be converted to ['ঊনিশশ সাতানব্বই সালের একুশ জানুয়ারী তে আমার জন্ম হয় '] by the parser.
  • It can synthesize arbitrarily long sentences with a simple segment-batch-merge trick.
  • It's free to use (even though the performance is not SOTA, but for Bangla, it outperforms almost all the parametric models).