Remove markdown code block markers from LLM's response #1641

ddragosd · 2024-11-22T23:51:10Z

@roribio and I debugged today why it takes so long (60s+) for a task to complete when using output_pydantic .

It turns out that it was caused by gpt-4o insisting to respond with block code markers when returning the JSON.

We tracked down the fix to the Converter throwing JSONDecodeError Error and going down the path of handle_partial_json , which, for a simple JSON (120 lines, 7KB) it takes 60s+ to complete.

bhancockio · 2024-11-26T14:35:18Z

Hey @ddragosd !

Thank you for creating this PR!

Root issue:

The models defined in output_pydantic and output_json are not passed into the LLM calls. As a result, the outputs of a task do not match the specified output format which leads us to doing another LLM call to format the original LLM output.

Fix:

When output_pydantic and output_json are present, append a stringified version of the schema to the prompt.

Improvement stats:

When running a simple crew with 1 task and a output_pydantic , our old approach would take around 30 seconds to complete the crew kickoff
Half of this time was dedicated to doing a followup LLM call to convert the output of the task into the specified pydantic model.
Now, the outputs of tasks automatically generate the correct format so it only takes 15 seconds which is a 50% speed increase.
We also do one less LLM call which saves users money as well

Closed by #1651

ddragosd · 2024-11-26T17:16:18Z

@bhancockio, I'm glad you caught a deeper issue with crewAI making two LLM calls instead of one, and improved the response time overall. This is awesome. I'm testing it right now. Thanks so much for the quick fix !!

Meanwhile, what do you think about support for [structured outputs] (https://openai.com/index/introducing-structured-outputs-in-the-api/)? I understand not all models have it today. But would you have any recommendations for how we could leverage this right now? I was gonna look for a way to support this, even work on a PR. Thanks !

ddascal and others added 5 commits November 22, 2024 13:35

remove backticks from llm result

3ee8165

Add test for validating model output without code block markers

948a79d

minor fix on converter

caab7aa

Add test for converting model output without code block markers

9dfb1af

Merge branch 'main' into fix_backtick

f2ae15f

bhancockio closed this Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove markdown code block markers from LLM's response #1641

Remove markdown code block markers from LLM's response #1641

ddragosd commented Nov 22, 2024

bhancockio commented Nov 26, 2024

ddragosd commented Nov 26, 2024

Remove markdown code block markers from LLM's response #1641

Remove markdown code block markers from LLM's response #1641

Conversation

ddragosd commented Nov 22, 2024

bhancockio commented Nov 26, 2024

ddragosd commented Nov 26, 2024