Support structured output in ChatDatabricks #28

B-Step62 · 2024-10-17T07:15:47Z

Resolve #19

Implement with_structured_output() method in ChatDatabricks to support structured output feature compatible with OpenAI. The code mostly copies ChatOpenAI implementation with slight adjustment e.g. FMAPI doesn't support parallel_tool_calls parameter.

Test

Unit test
Integration test via local machine.
Tested on Databricks:

Signed-off-by: B-Step62 <[email protected]>

B-Step62 · 2024-10-18T00:28:26Z

@harupy @BenWilson2 Could you review this change that mostly copies what OAI does? Wanted to close the FR before releasing the next version by the end of this week.

BenWilson2 · 2024-10-18T01:55:33Z

libs/databricks/langchain_databricks/chat_models.py

+                )
+            else:
+                output_parser = JsonOutputKeyToolsParser(
+                    key_name=tool_name, first_tool_only=True


In the OAI implementation, they're doing the same thing?
IIRC their tool requests are sequential, but it will be good to double check.

Yes https://github.com/langchain-ai/langchain/blob/4fab8996cf3a5a34bd5333c6848b0bccf798a6a0/libs/partners/openai/langchain_openai/chat_models/base.py#L1234-L1236

I think the rational is that it is guaranteed to have only one tool matches with the tool_name we pass here.

SGTM! Let's merge :)

BenWilson2

overall LGTM - let's verify that the tool response requests from OpenAI are always provided as sequential iterations (I BELIEVE that is how their APIs work, but let's be sure we're not inadvertently dropping multiple tool requests; for context, Anthropic adheres to a similar response structure but their ToolRequestFormat is a collection instead of sequential calls)

B-Step62 · 2024-10-18T02:05:07Z

Reading through the code closely, the more direct reason is that LLM can only returns single tool call for structured output because we (and OAI/Anthoropic) set tool_choice parameter to only include the formatting tool🙂

https://github.com/langchain-ai/langchain/blob/4fab8996cf3a5a34bd5333c6848b0bccf798a6a0/libs/partners/openai/langchain_openai/chat_models/base.py#L1226

It's good to understand the rational behind (rather than no-brain copy paste). Great callout, Ben!

Support structured output in ChatDatabricks

114ffb5

Signed-off-by: B-Step62 <[email protected]>

B-Step62 requested review from serena-ruan, harupy and BenWilson2 October 17, 2024 07:56

BenWilson2 reviewed Oct 18, 2024

View reviewed changes

BenWilson2 approved these changes Oct 18, 2024

View reviewed changes

B-Step62 merged commit ff1a60b into langchain-ai:main Oct 18, 2024
8 checks passed

B-Step62 deleted the structured-output branch October 18, 2024 02:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support structured output in ChatDatabricks #28

Support structured output in ChatDatabricks #28

B-Step62 commented Oct 17, 2024 •

edited

Loading

B-Step62 commented Oct 18, 2024

BenWilson2 Oct 18, 2024

B-Step62 Oct 18, 2024 •

edited

Loading

BenWilson2 Oct 18, 2024

BenWilson2 left a comment

B-Step62 commented Oct 18, 2024 •

edited

Loading

Support structured output in ChatDatabricks #28

Support structured output in ChatDatabricks #28

Conversation

B-Step62 commented Oct 17, 2024 • edited Loading

B-Step62 commented Oct 18, 2024

BenWilson2 Oct 18, 2024

Choose a reason for hiding this comment

B-Step62 Oct 18, 2024 • edited Loading

Choose a reason for hiding this comment

BenWilson2 Oct 18, 2024

Choose a reason for hiding this comment

BenWilson2 left a comment

Choose a reason for hiding this comment

B-Step62 commented Oct 18, 2024 • edited Loading

B-Step62 commented Oct 17, 2024 •

edited

Loading

B-Step62 Oct 18, 2024 •

edited

Loading

B-Step62 commented Oct 18, 2024 •

edited

Loading