feat: add support for base64 encoded audio files #380

mdimado · 2025-01-21T06:33:41Z

Purpose

Add support for base64 encoded files

Proposed Changes

remove audio blocking check
add base64 audio file handling in MultimodalMessage
add audio content entries with audio/wav mime type

Issues

Fixes ensure api complience with audio for multimodal messages #373

rachwalk

Thank you for the contribution, I will approve the changes once the tests pass and you address the comments I made. Also please git rebase onto development, so the newest features are included. Do you think you'd be interested in working on #374? Especially the 2. point - ensuring that LLMs can accept this audio format. For now it would be okay, if only .wav is supported, the audio conversion functions can be done later.

rachwalk · 2025-01-21T10:23:12Z

src/rai/rai/messages/multimodal.py

+        # remove the audio blocking check
+        # if self.audios not in [None, []]:
+        #     raise ValueError("Audio is not yet supported")


If you remove a block of code please don't leave it as a comment. This will help maintain the codebase clean.

my bad, I will not leave it as a comment. I will remove it in the next pr

src/rai/rai/messages/multimodal.py

rachwalk · 2025-01-21T10:39:14Z

It also seems that there were tests which checked for audio not being supported. Since your PR adds support for audio it would be beneficial if you updated these tests too.

Co-authored-by: Kajetan Rachwał <[email protected]>

mdimado · 2025-01-22T06:23:30Z

alright, I'll create a pr in with the updated tests

rachwalk · 2025-01-24T12:29:38Z

@mdimado Can I close this PR in favour of #382?

mdimado · 2025-01-24T12:30:55Z

@mdimado Can I close this PR in favour of #382?

Sure

rachwalk · 2025-01-24T12:36:50Z

Closing in favour of #382

feat: add support for base64 encoded audio files

ed872c4

rachwalk reviewed Jan 21, 2025

View reviewed changes

Update src/rai/rai/messages/multimodal.py

73ddef7

Co-authored-by: Kajetan Rachwał <[email protected]>

mdimado mentioned this pull request Jan 22, 2025

Test/add multimodal audio test #382

Open

rachwalk closed this Jan 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add support for base64 encoded audio files #380

feat: add support for base64 encoded audio files #380

mdimado commented Jan 21, 2025

rachwalk left a comment •

edited

Loading

rachwalk Jan 21, 2025

mdimado Jan 22, 2025

rachwalk commented Jan 21, 2025

mdimado commented Jan 22, 2025

rachwalk commented Jan 24, 2025

mdimado commented Jan 24, 2025

rachwalk commented Jan 24, 2025

feat: add support for base64 encoded audio files #380

feat: add support for base64 encoded audio files #380

Conversation

mdimado commented Jan 21, 2025

Purpose

Proposed Changes

Issues

rachwalk left a comment • edited Loading

Choose a reason for hiding this comment

rachwalk Jan 21, 2025

Choose a reason for hiding this comment

mdimado Jan 22, 2025

Choose a reason for hiding this comment

rachwalk commented Jan 21, 2025

mdimado commented Jan 22, 2025

rachwalk commented Jan 24, 2025

mdimado commented Jan 24, 2025

rachwalk commented Jan 24, 2025

rachwalk left a comment •

edited

Loading