feat(drivers-prompt-openai):add audio input/output support to OpenAiChatPromptDriver #1617

collindutter · 2025-01-24T21:55:00Z

I have read and agree to the contributing guidelines.

Describe your changes

Added

Support for AudioArtifact inputs/outputs in OpenAiChatPromptDriver.

Issue ticket number and link

Closes #1601

codecov · 2025-01-29T23:27:51Z

Codecov Report

Attention: Patch coverage is 96.55172% with 3 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
...iptape/drivers/prompt/openai_chat_prompt_driver.py	93.33%	0 Missing and 2 partials ⚠️
griptape/tasks/prompt_task.py	80.00%	0 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

collindutter · 2025-01-30T00:14:57Z

Still need docs, code is ready for review though.

vachillo · 2025-02-07T18:02:49Z

griptape/drivers/prompt/base_prompt_driver.py

@@ -177,6 +180,8 @@ def __process_stream(self, prompt_stack: PromptStack) -> Message:
                    delta_contents[content.index] = [content]
                if isinstance(content, TextDeltaMessageContent):
                    EventBus.publish_event(TextChunkEvent(token=content.text, index=content.index))
+                elif isinstance(content, AudioDeltaMessageContent) and content.data is not None:


what would be the scenario where data is None?

OpenAi sometimes sends chunks with no content. For example, a final chunk that contains the usage.

vachillo · 2025-02-07T18:05:52Z

griptape/drivers/prompt/openai_chat_prompt_driver.py

+            "modalities": self.modalities,
+            "audio": self.audio,


should these follow the same **(...) syntax?

They get ignored when using non-audio models. But I can see openai-compatible endpoints throwing a fit. I'll make conditional.

vachillo · 2025-02-07T18:11:19Z

griptape/drivers/prompt/openai_chat_prompt_driver.py

+                    elif (
+                        isinstance(content, AudioMessageContent)
+                        and message.is_assistant()
+                        and time.time() < content.artifact.meta.get("expires_at", float("inf"))


what is the scenario where there is no expires_at? are we just assuming no expires_at == try it anyway and see if its expired or not?

vachillo · 2025-02-07T18:13:36Z

griptape/drivers/prompt/openai_chat_prompt_driver.py

+            artifact = content.artifact
+
+            # We can't send the audio if it's expired.
+            if int(time.time()) > artifact.meta.get("expires_at", float("inf")):


opposite logic here. wondering if we should just always try to send the transcript if there is no expires_at. i also probably dont understand when certain fields are set or unset, and if they are allowed to be None or not

vachillo · 2025-02-07T18:16:14Z

griptape/tasks/actions_subtask.py

@@ -257,7 +265,7 @@ def __init_from_artifact(self, artifact: TextArtifact | ListArtifact) -> None:
            None
        """
        # When using native tools, we can assume that a TextArtifact is the LLM providing its final answer.


update comment

…hatPromptDriver

collindutter force-pushed the feature/openai-audio branch from 0ff40f8 to 375628e Compare January 24, 2025 21:55

collindutter mentioned this pull request Jan 29, 2025

Refactor ActionsSubtask for more precise initialization logic #1626

Merged

1 task

collindutter force-pushed the feature/openai-audio branch 2 times, most recently from efa0532 to 7c77f8d Compare January 29, 2025 23:25

collindutter force-pushed the feature/openai-audio branch 2 times, most recently from 5f9394f to 73e246a Compare January 30, 2025 00:03

collindutter requested a review from dylanholmes January 30, 2025 00:15

collindutter force-pushed the feature/openai-audio branch from 73e246a to be417c1 Compare January 30, 2025 00:15

collindutter requested a review from vachillo January 30, 2025 00:15

collindutter marked this pull request as ready for review January 30, 2025 00:15

collindutter force-pushed the feature/openai-audio branch 2 times, most recently from 442d68f to f26f6b3 Compare February 4, 2025 17:53

collindutter enabled auto-merge February 4, 2025 17:54

collindutter force-pushed the feature/openai-audio branch 2 times, most recently from 37f793f to f023372 Compare February 5, 2025 02:02

collindutter changed the title ~~OpenAi Audio Inputs/Outputs~~ feat(drivers-prompt-openai):add audio input/output support to OpenAiChatPromptDriver Feb 5, 2025

collindutter force-pushed the feature/openai-audio branch 2 times, most recently from bc9a49c to 85e5497 Compare February 7, 2025 16:19

vachillo reviewed Feb 7, 2025

View reviewed changes

collindutter force-pushed the feature/openai-audio branch from 85e5497 to e3a3850 Compare February 7, 2025 19:01

feat(drivers-prompt-openai):add audio input/output support to OpenAiC…

884a7ef

…hatPromptDriver

collindutter force-pushed the feature/openai-audio branch from e3a3850 to 884a7ef Compare February 7, 2025 19:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(drivers-prompt-openai):add audio input/output support to OpenAiChatPromptDriver #1617

feat(drivers-prompt-openai):add audio input/output support to OpenAiChatPromptDriver #1617

collindutter commented Jan 24, 2025 •

edited

Loading

codecov bot commented Jan 29, 2025 •

edited

Loading

collindutter commented Jan 30, 2025 •

edited

Loading

vachillo Feb 7, 2025

collindutter Feb 7, 2025

vachillo Feb 7, 2025

collindutter Feb 7, 2025

vachillo Feb 7, 2025

vachillo Feb 7, 2025

vachillo Feb 7, 2025

feat(drivers-prompt-openai):add audio input/output support to OpenAiChatPromptDriver #1617

Are you sure you want to change the base?

feat(drivers-prompt-openai):add audio input/output support to OpenAiChatPromptDriver #1617

Conversation

collindutter commented Jan 24, 2025 • edited Loading

Describe your changes

Added

Issue ticket number and link

codecov bot commented Jan 29, 2025 • edited Loading

Codecov Report

collindutter commented Jan 30, 2025 • edited Loading

vachillo Feb 7, 2025

Choose a reason for hiding this comment

collindutter Feb 7, 2025

Choose a reason for hiding this comment

vachillo Feb 7, 2025

Choose a reason for hiding this comment

collindutter Feb 7, 2025

Choose a reason for hiding this comment

vachillo Feb 7, 2025

Choose a reason for hiding this comment

vachillo Feb 7, 2025

Choose a reason for hiding this comment

vachillo Feb 7, 2025

Choose a reason for hiding this comment

collindutter commented Jan 24, 2025 •

edited

Loading

codecov bot commented Jan 29, 2025 •

edited

Loading

collindutter commented Jan 30, 2025 •

edited

Loading