Add speed and emotion options for Cartesia. #435

golbin · 2024-08-31T08:50:28Z

I've just added speed and emotion options to Cartesia TTS.

While these are experimental features, they are very useful and could soon be ready for production.

aconchillo · 2024-09-06T02:22:30Z

src/pipecat/services/cartesia.py

@@ -73,6 +73,8 @@ def __init__(
            encoding: str = "pcm_s16le",
            sample_rate: int = 16000,
            language: str = "en",
+            speed: str = None,
+            emotion: list = None,


I'm thinking that instead of adding more arguments we could do like FalImageGenService or GladiaSTTService:

class CartesiaTTSService(TTSService): class InputParams(BaseModel): speed: Optional[str] = None, emotion: Optional[List[str]] = None

In that case, wouldn’t it be better to include the other options like encoding, sample_rate, and language in the InputParams as well?

Yes, I just didn't want to ask you to do more work 😅

No problem, I’ve made a patch based on your advice.

The 'voice_id' parameter is required, so I’ve left it as the default parameter.

Please review and let me know if everything looks good.

Hmm… maybe we need something like TTSEmotionUpdateFrame and TTSSpeedUpdateFrame? 🤔

Not needed for now.

aconchillo · 2024-09-10T00:57:44Z

src/pipecat/services/cartesia.py

@@ -61,6 +62,14 @@ def language_to_cartesia_language(language: Language) -> str | None:


 class CartesiaTTSService(TTSService):
+    class InputParams(BaseModel):
+        model_id: Optional[str] = "sonic-english"


I'd keep the model_id as a main argument as with all the other AI services. The rest looks great!

There's also a conflict.

golbin · 2024-09-17T15:39:22Z

@aconchillo I fixed conflict and revert "model_id" as a main argument. Please check that.

aconchillo · 2024-09-20T21:49:06Z

Looks great! I believe there's one more conflict but I'll merge right away once that's solved. Thank you for your patience!

golbin · 2024-09-23T07:36:42Z

I'm sorry for the delay. The conflict has been resolved.

golbin · 2024-09-23T23:04:53Z

Apply and fix for recent changes. It might be better to make CartesiaTTS class for inheritance in CartesiaTTSService/CartesiaHttpService later.

williamtran29 · 2024-09-25T16:07:45Z

Can we merge this asap? the speed is too fast atm

markbackman · 2024-09-25T18:48:24Z

@golbin we've made a switch to Ruff as a formatter. This was to resolve formatting issues with Autopep-8. The README has been updated with instructions for setting up Ruff as a linter in your IDE. Can you set up Ruff, fix the linting issues, and push your latest changes? With that either @aconchillo or I can merge it down right away.

golbin · 2024-09-26T01:38:43Z

I will fix it in 24 hours.

markbackman · 2024-09-26T11:14:55Z

Thanks @golbin! 🙌

aconchillo reviewed Sep 6, 2024

View reviewed changes

Add voice options and make to use InputParams for Cartesia.

fa0deed

golbin force-pushed the main branch from 3c716f6 to fa0deed Compare September 9, 2024 01:58

aconchillo reviewed Sep 10, 2024

View reviewed changes

golbin added 2 commits September 18, 2024 00:33

Merge remote-tracking branch 'upstream/main'

c7f814b

Revert "model_id" as a main argument

2da0ecb

golbin mentioned this pull request Sep 17, 2024

add voice control for speed and emotions #461

Closed

Add speed and emotion setting method to Cartesia TTS service

75008d8

aconchillo approved these changes Sep 20, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/main'

68cc418

golbin added 2 commits September 24, 2024 07:18

Merge remote-tracking branch 'upstream/main'

cf72129

Apply and Fix upstream changes for Cartesia

49f2123

markbackman self-requested a review September 25, 2024 18:48

Apply Ruff formater

d05717a

markbackman merged commit b1818cc into pipecat-ai:main Sep 26, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add speed and emotion options for Cartesia. #435

Add speed and emotion options for Cartesia. #435

golbin commented Aug 31, 2024

aconchillo Sep 6, 2024

golbin Sep 8, 2024

aconchillo Sep 8, 2024

golbin Sep 9, 2024

golbin Sep 9, 2024

aconchillo Sep 10, 2024

aconchillo Sep 10, 2024

aconchillo Sep 10, 2024

golbin commented Sep 17, 2024

aconchillo commented Sep 20, 2024

golbin commented Sep 23, 2024

golbin commented Sep 23, 2024 •

edited

Loading

williamtran29 commented Sep 25, 2024

markbackman commented Sep 25, 2024

golbin commented Sep 26, 2024

markbackman commented Sep 26, 2024

Add speed and emotion options for Cartesia. #435

Add speed and emotion options for Cartesia. #435

Conversation

golbin commented Aug 31, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

golbin commented Sep 17, 2024

aconchillo commented Sep 20, 2024

golbin commented Sep 23, 2024

golbin commented Sep 23, 2024 • edited Loading

williamtran29 commented Sep 25, 2024

markbackman commented Sep 25, 2024

golbin commented Sep 26, 2024

markbackman commented Sep 26, 2024

golbin commented Sep 23, 2024 •

edited

Loading