Skip to content

Commit

Permalink
Update CHANGELOG
Browse files Browse the repository at this point in the history
  • Loading branch information
markbackman committed Jan 10, 2025
1 parent e0124df commit 4769663
Show file tree
Hide file tree
Showing 3 changed files with 22 additions and 11 deletions.
31 changes: 22 additions & 9 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,9 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0

### Added

- Added a new `WebsocketService` based class for TTS services, containing
base functions and retry logic.

- Added support for Google TTS Journey voices in `GoogleTTSService`.

- Added `29-livekit-audio-chat.py`, as a new foundational examples for
Expand All @@ -17,26 +20,36 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
- Added `enable_prejoin_ui`, `max_participants` and `start_video_off` params
to `DailyRoomProperties`.

- Added `session_timeout` to `FastAPIWebsocketTransport` and `WebsocketServerTransport`
for configuring session timeouts (in seconds). Triggers `on_session_timeout` for custom timeout handling.
See [examples/websocket-server/bot.py](https://github.com/pipecat-ai/pipecat/blob/main/examples/websocket-server/bot.py).
- Added `session_timeout` to `FastAPIWebsocketTransport` and
`WebsocketServerTransport` for configuring session timeouts (in seconds).
Triggers `on_session_timeout` for custom timeout handling. See
[examples/websocket-server/bot.py](https://github.com/pipecat-ai/pipecat/blob/main/examples/websocket-server/bot.py).

- Added the new modalities option and helper function to set Gemini output modalities.
- Added the new modalities option and helper function to set Gemini output
modalities.

- Added `examples/foundational/26d-gemini-multimodal-live-text.py` which is using Gemini as TEXT modality and using another TTS provider for TTS process.
- Added `examples/foundational/26d-gemini-multimodal-live-text.py` which is
using Gemini as TEXT modality and using another TTS provider for TTS process.

### Changed

- api_key, aws_access_key_id and region are no longer required parameters for the PollyTTSService (AWSTTSService)
- Added `session_timeout` example in `examples/websocket-server/bot.py` to handle session timeout event.
- Changed `InputParams` in `src/pipecat/services/gemini_multimodal_live/gemini.py` to support different modalities.
- api_key, aws_access_key_id and region are no longer required parameters for
the PollyTTSService (AWSTTSService).
- Added `session_timeout` example in `examples/websocket-server/bot.py` to
handle session timeout event.
- Changed `InputParams` in `src/pipecat/services/gemini_multimodal_live/gemini.py`
to support different modalities.

### Fixed

- Fixed an issue where websocket based TTS services could incorrectly terminate
their connection due to a retry counter not resetting.

- Fixed an issue where `OpenAIRealtimeBetaLLMService` audio chunks were hitting
an error when truncating audio content.

- Fixed an issue where setting the voice and model for `RimeHttpTTSService` wasn't working.
- Fixed an issue where setting the voice and model for `RimeHttpTTSService`
wasn't working.

## [0.0.52] - 2024-12-24

Expand Down
1 change: 0 additions & 1 deletion src/pipecat/services/cartesia.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,6 @@
# SPDX-License-Identifier: BSD 2-Clause License
#

import asyncio
import base64
import json
import uuid
Expand Down
1 change: 0 additions & 1 deletion src/pipecat/services/playht.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,6 @@
# SPDX-License-Identifier: BSD 2-Clause License
#

import asyncio
import io
import json
import struct
Expand Down

0 comments on commit 4769663

Please sign in to comment.