Add LiveKit audio transport #467

joachimchauvet · 2024-09-17T15:23:06Z

This might not be perfect yet but I would love to get some initial feedback.
It looks like there are quite a few people interested #325

aconchillo · 2024-09-17T17:58:14Z

src/pipecat/transports/services/livekit.py

+    from livekit import rtc
+except ModuleNotFoundError as e:
+    logger.error(f"Exception: {e}")
+    logger.error("In order to use LiveKit, you need to `pip install pipecat-ai[livekit]`.")


We probably want to add livekit to pyproject.toml with the right dependencies.

aconchillo · 2024-09-17T19:08:45Z

This looks really good! I'll take a final look later, thank you!!!

cyrilS-dev · 2024-09-17T22:46:28Z

This looks amazing @joachimchauvet ! I'm particularly interested in this PR as I'm using LiveKit.
I will give it a try as soon as possible and provide feedback. Looking forward to testing it out!
Many thanks

cyrilS-dev · 2024-09-19T17:47:19Z

@joachimchauvet I just tested and I can't play the audio when I join the room using the LK playground. I join the same room as my agent, I have both participants active in the room, the room events are consistent, but the 'Agent connected' status on the playground remains pending, as does the audio track.

joachimchauvet · 2024-09-19T18:28:29Z

@joachimchauvet I just tested and I can't play the audio when I join the room using the LK playground. I join the same room as my agent, I have both participants active in the room, the room events are consistent, but the 'Agent connected' status on the playground remains pending, as does the audio track.

It seems to work fine on my side.
However, I noticed that I forgot to adjust the sample rate in https://github.com/joachimchauvet/pipecat-livekit/blob/main/examples/foundational/01b-livekit-audio.py
Could it be that the voice was playing so fast that you did not really hear it if you were using that example?
Are you using a different token for the bot and the user?

aconchillo · 2024-09-23T18:40:00Z

This looks great! Please, rebase and make sure the pipeline is green. We are now using Ruff as our formatter. Make sure everything is green and I think we cna merge after that. Thank you! 🙏

cyrilS-dev · 2024-09-23T19:32:04Z

@joachimchauvet I encountered an error while using the LiveKitInputTransport in my application :

Error in audio input task: 'LiveKitInputTransport' object has no attribute '_internal_push_frame'

_internal_push_frame method is not defined in the LiveKitInputTransport class

joachimchauvet · 2024-09-24T07:26:24Z

@joachimchauvet I encountered an error while using the LiveKitInputTransport in my application :

Error in audio input task: 'LiveKitInputTransport' object has no attribute '_internal_push_frame'

_internal_push_frame method is not defined in the LiveKitInputTransport class

Were you doing something special or made changes to the LiveKitInputTransport? It inherits the BaseInputTransport so _internal_push_frame should be defined.
The method works fine on my side 🤔

cyrilS-dev · 2024-09-24T11:15:23Z

@joachimchauvet I encountered an error while using the LiveKitInputTransport in my application :
Error in audio input task: 'LiveKitInputTransport' object has no attribute '_internal_push_frame'
_internal_push_frame method is not defined in the LiveKitInputTransport class

Were you doing something special or made changes to the LiveKitInputTransport? It inherits the BaseInputTransport so _internal_push_frame should be defined. The method works fine on my side 🤔

There is no _internal_push_frame method into BaseInputTransport since #436

cyrilS-dev · 2024-09-25T16:05:49Z

@joachimchauvet I encountered an error in the _audio_in_task_handler method of the LiveKitInputTransport class., using Deepgram as STT. :

AttributeError: 'AsyncListenWebSocketClient' object has no attribute '_socket'
2024-09-25 15:49:15.863 | ERROR    | pipecat.processors.frame_processor:push_frame:203 - Uncaught exception in LiveKitInputTransport#0: 'AsyncListenWebSocketClient' object has no attribute '_socket'

The issue arises when await self.push_frame(pipecat_audio_frame) attempts to push frames before the Deepgram WebSocket connection is fully initialized.

aconchillo · 2024-09-25T17:15:05Z

@joachimchauvet I encountered an error in the _audio_in_task_handler method of the LiveKitInputTransport class., using Deepgram as STT. :
AttributeError: 'AsyncListenWebSocketClient' object has no attribute '_socket'
2024-09-25 15:49:15.863 | ERROR    | pipecat.processors.frame_processor:push_frame:203 - Uncaught exception in LiveKitInputTransport#0: 'AsyncListenWebSocketClient' object has no attribute '_socket'
The issue arises when await self.push_frame(pipecat_audio_frame) attempts to push frames before the Deepgram WebSocket connection is fully initialized.

In theory, all processors should receive the StartFrame first (this is guaranteed now in main) and they should make sure any connection or whatever is needed is established so it's safe to push frames after.

It's possible that this branch hasn't been rebased so it doesn't have that StartFrame guarantee. Would it be possible for you to rebase locally and try that? I could be wrong and maybe it's something else...

cyrilS-dev · 2024-09-25T17:36:00Z

@joachimchauvet I encountered an error in the _audio_in_task_handler method of the LiveKitInputTransport class., using Deepgram as STT. :
AttributeError: 'AsyncListenWebSocketClient' object has no attribute '_socket'
2024-09-25 15:49:15.863 | ERROR    | pipecat.processors.frame_processor:push_frame:203 - Uncaught exception in LiveKitInputTransport#0: 'AsyncListenWebSocketClient' object has no attribute '_socket'
The issue arises when await self.push_frame(pipecat_audio_frame) attempts to push frames before the Deepgram WebSocket connection is fully initialized.
In theory, all processors should receive the StartFrame first (this is guaranteed now in main) and they should make sure any connection or whatever is needed is established so it's safe to push frames after.

It's possible that this branch hasn't been rebased so it doesn't have that StartFrame guarantee. Would it be possible for you to rebase locally and try that? I could be wrong and maybe it's something else...

You're absolutely right @aconchillo. Thanks for pointing that out!

Now, regarding this part:

async def send_metrics(self, frame: MetricsFrame):
        metrics = {}
        if frame.ttfb:
            metrics["ttfb"] = frame.ttfb
        if frame.processing:
            metrics["processing"] = frame.processing
        if hasattr(frame, "tokens"):
            metrics["tokens"] = frame.tokens
        if hasattr(frame, "characters"):
            metrics["characters"] = frame.characters

        message = LiveKitTransportMessageFrame(message={"type": "pipecat-metrics", "metrics": metrics})
        await self._client.send_data(str(message.message).encode())

This needs to be updated due to changes introduced in #474

cyrilS-dev · 2024-09-25T19:49:18Z

Everything is running perfectly; just one detail: LiveKitInputTransport stops when the room is disconnected, but LiveKitOutputTransport doesn't; this triggers an error after a while from the TTS service: no close frame received or sent.

aconchillo · 2024-09-25T20:28:36Z

Everything is running perfectly; just one detail: LiveKitInputTransport stops when the room is disconnected, but LiveKitOutputTransport doesn't; this triggers an error after a while from the TTS service: no close frame received or sent.

I'll wait on @joachimchauvet to answer this before merging.

aconchillo · 2024-09-27T05:58:14Z

I'll go ahead and merge this one. If there's any error you can provide a fix later. Thank you!!!

joachimchauvet · 2024-09-27T06:43:58Z

Everything is running perfectly; just one detail: LiveKitInputTransport stops when the room is disconnected, but LiveKitOutputTransport doesn't; this triggers an error after a while from the TTS service: no close frame received or sent.

I don't have this error with my agents. I have an idea where that might come from but I was not able to reproduce that no close frame received or sent directly. @cyrilS-dev could you share a pipeline that triggers this error?

cyrilS-dev · 2024-09-28T19:16:28Z

@joachimchauvet The execution is blocked when calling await super().stop(frame) in the stop method of the LiveKitOutputTransport class.

The stop method in the BaseOutputTransport class is causing the execution to hang due to :

if self._sink_task:
     await self._sink_task
if self._sink_clock_task:
     await self._sink_clock_task

As a result, the await self._client.disconnect() call is never reached.

@aconchillo do you have any insights on this? Thanks

aconchillo reviewed Sep 17, 2024

View reviewed changes

joachimchauvet force-pushed the main branch from e61427a to 61f8f81 Compare September 19, 2024 18:23

joachimchauvet and others added 5 commits September 24, 2024 10:16

add LiveKit audio transport

a9390d9

add tenacity dependency

2e5b0c1

move tenacity imports inside try block

470b5ea

adjust output sample rate and create user token

fa609f1

format with ruff

b6e1d6e

joachimchauvet force-pushed the main branch from 61f8f81 to b6e1d6e Compare September 24, 2024 07:21

remove _internal_push_frame from LiveKitInputTransport

ec5998b

update send_metrics() to support changes introduced in pipecat-ai#474

447baad

aconchillo approved these changes Sep 25, 2024

View reviewed changes

aconchillo merged commit 4501dca into pipecat-ai:main Sep 27, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LiveKit audio transport #467

Add LiveKit audio transport #467

joachimchauvet commented Sep 17, 2024 •

edited

Loading

aconchillo Sep 17, 2024

aconchillo commented Sep 17, 2024

cyrilS-dev commented Sep 17, 2024

cyrilS-dev commented Sep 19, 2024

joachimchauvet commented Sep 19, 2024

aconchillo commented Sep 23, 2024

cyrilS-dev commented Sep 23, 2024 •

edited

Loading

joachimchauvet commented Sep 24, 2024

cyrilS-dev commented Sep 24, 2024 •

edited

Loading

cyrilS-dev commented Sep 25, 2024

aconchillo commented Sep 25, 2024

cyrilS-dev commented Sep 25, 2024 •

edited

Loading

cyrilS-dev commented Sep 25, 2024

aconchillo commented Sep 25, 2024

aconchillo commented Sep 27, 2024

joachimchauvet commented Sep 27, 2024

cyrilS-dev commented Sep 28, 2024 •

edited

Loading

Add LiveKit audio transport #467

Add LiveKit audio transport #467

Conversation

joachimchauvet commented Sep 17, 2024 • edited Loading

aconchillo Sep 17, 2024

Choose a reason for hiding this comment

aconchillo commented Sep 17, 2024

cyrilS-dev commented Sep 17, 2024

cyrilS-dev commented Sep 19, 2024

joachimchauvet commented Sep 19, 2024

aconchillo commented Sep 23, 2024

cyrilS-dev commented Sep 23, 2024 • edited Loading

joachimchauvet commented Sep 24, 2024

cyrilS-dev commented Sep 24, 2024 • edited Loading

cyrilS-dev commented Sep 25, 2024

aconchillo commented Sep 25, 2024

cyrilS-dev commented Sep 25, 2024 • edited Loading

cyrilS-dev commented Sep 25, 2024

aconchillo commented Sep 25, 2024

aconchillo commented Sep 27, 2024

joachimchauvet commented Sep 27, 2024

cyrilS-dev commented Sep 28, 2024 • edited Loading

joachimchauvet commented Sep 17, 2024 •

edited

Loading

cyrilS-dev commented Sep 23, 2024 •

edited

Loading

cyrilS-dev commented Sep 24, 2024 •

edited

Loading

cyrilS-dev commented Sep 25, 2024 •

edited

Loading

cyrilS-dev commented Sep 28, 2024 •

edited

Loading