diff --git a/CHANGELOG.md b/CHANGELOG.md index 3545b0744..828fdb4fc 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -9,6 +9,9 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0 ### Added +- Added a new `WebsocketService` based class for TTS services, containing + base functions and retry logic. + - Added support for Google TTS Journey voices in `GoogleTTSService`. - Added `29-livekit-audio-chat.py`, as a new foundational examples for @@ -17,26 +20,36 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0 - Added `enable_prejoin_ui`, `max_participants` and `start_video_off` params to `DailyRoomProperties`. -- Added `session_timeout` to `FastAPIWebsocketTransport` and `WebsocketServerTransport` - for configuring session timeouts (in seconds). Triggers `on_session_timeout` for custom timeout handling. - See [examples/websocket-server/bot.py](https://github.com/pipecat-ai/pipecat/blob/main/examples/websocket-server/bot.py). +- Added `session_timeout` to `FastAPIWebsocketTransport` and + `WebsocketServerTransport` for configuring session timeouts (in seconds). + Triggers `on_session_timeout` for custom timeout handling. See + [examples/websocket-server/bot.py](https://github.com/pipecat-ai/pipecat/blob/main/examples/websocket-server/bot.py). -- Added the new modalities option and helper function to set Gemini output modalities. +- Added the new modalities option and helper function to set Gemini output + modalities. -- Added `examples/foundational/26d-gemini-multimodal-live-text.py` which is using Gemini as TEXT modality and using another TTS provider for TTS process. +- Added `examples/foundational/26d-gemini-multimodal-live-text.py` which is + using Gemini as TEXT modality and using another TTS provider for TTS process. ### Changed -- api_key, aws_access_key_id and region are no longer required parameters for the PollyTTSService (AWSTTSService) -- Added `session_timeout` example in `examples/websocket-server/bot.py` to handle session timeout event. -- Changed `InputParams` in `src/pipecat/services/gemini_multimodal_live/gemini.py` to support different modalities. +- api_key, aws_access_key_id and region are no longer required parameters for + the PollyTTSService (AWSTTSService). +- Added `session_timeout` example in `examples/websocket-server/bot.py` to + handle session timeout event. +- Changed `InputParams` in `src/pipecat/services/gemini_multimodal_live/gemini.py` + to support different modalities. ### Fixed +- Fixed an issue where websocket based TTS services could incorrectly terminate + their connection due to a retry counter not resetting. + - Fixed an issue where `OpenAIRealtimeBetaLLMService` audio chunks were hitting an error when truncating audio content. -- Fixed an issue where setting the voice and model for `RimeHttpTTSService` wasn't working. +- Fixed an issue where setting the voice and model for `RimeHttpTTSService` + wasn't working. ## [0.0.52] - 2024-12-24 diff --git a/src/pipecat/services/cartesia.py b/src/pipecat/services/cartesia.py index 2a2f0e00b..ed6f4145e 100644 --- a/src/pipecat/services/cartesia.py +++ b/src/pipecat/services/cartesia.py @@ -4,7 +4,6 @@ # SPDX-License-Identifier: BSD 2-Clause License # -import asyncio import base64 import json import uuid diff --git a/src/pipecat/services/playht.py b/src/pipecat/services/playht.py index ae2128197..ae5a19f82 100644 --- a/src/pipecat/services/playht.py +++ b/src/pipecat/services/playht.py @@ -4,7 +4,6 @@ # SPDX-License-Identifier: BSD 2-Clause License # -import asyncio import io import json import struct