Function calling #175

chadbailey59 · 2024-05-23T14:29:01Z

No description provided.

examples/foundational/10-wake-word.py

src/pipecat/processors/aggregators/llm_response.py

chadbailey59 · 2024-05-24T18:13:59Z

So, function calling is... weird.

Without function calling, a chatbot pipeline is pretty straightforward:

The user says something
Pipecat appends whatever the user said to a messages list and send that list to the LLM
The LLM generates an assistant response
Pipecat generates TTS from that assistant response and plays that audio through the transport

With function calling, the flow is different:

The user says something
Pipecat appends whatever the user said to a messages list (along with some possible "tools" it may choose to use) and sends that list to the LLM
The LLM generates an assistant response that may be text, or it may be a "tool call", i.e. the LLM decides to use one of the available "tools"
- If it's text:
  - Pipecat generates TTS from that assistant response and plays that audio through the transport
- if it's a tool call:
  - Pipecat needs to append the assistant message with the tool call, including its params, to the message list
  - Pipecat then needs to call the requested function with the provided params (e.g. "check_weather" with params {location: 'san francisco'}
  - Pipecat then needs to append the results from that function call to the messages list in a weird format
  - Pipecat then needs to re-prompt the LLM with the new messages list to generate an answer to the user's question
  - Finally, Pipecat generates TTS from that second assistant response and plays that audio through the transport

Right now, that entire second branch is implemented in the 15-function-calling example as a FunctionCaller class that pushes a context frame back up the pipeline for the re-prompting. We should probably be handling all this inside the framework itself, but that starts to touch on how much context management we should be doing on behalf of the user.

aconchillo · 2024-05-24T18:35:18Z

examples/foundational/10-wake-word.py

-            llm,                  # LLM
-            tts,                  # TTS
-            transport.output(),   # Transport bot output
-            tma_out               # Santa Cat spoken responses


All the comments are removed? And we should use WakeCheckFilter instead.

src/pipecat/utils/test_frame_processor.py

src/pipecat/services/openai.py

aconchillo · 2024-05-24T19:13:09Z

examples/foundational/15-function-calling.py

+                ]
+
+            })
+            self._context.add_message(tool_call)


I believe this can be added internally when we push LLMFunctionCallFrame. There's nothing the user should do in here.

I'm hesitant to do this on behalf of the user, because as a general rule we don't mess with the context in the framework. In the patient-intake example, I'm kind of misusing function calling and not actually inserting function call results into the context, for example.

I see. Well, in this case it's not the result, it's just the function call itself. Maybe for this one maybe we could add an argument to the LLM service, something like include_function_calling_in_context or something like that (and it defaults to True). Asking the user to handle all this is too much probably?

examples/foundational/15-function-calling.py

chadbailey59 · 2024-05-28T17:06:19Z

@aconchillo I think I've addressed the concerns and the function calling code is ready to merge. But I'm still concerned I may have inadvertently undone some of your changes through various merges and rebases.

src/pipecat/frames/frames.py

aconchillo · 2024-05-28T17:22:12Z

src/pipecat/services/openai.py

    LLMFullResponseEndFrame,
    LLMFullResponseStartFrame,
+    LLMFunctionStartFrame,


Remove LLMFunctionStartFrame and CallFrame

chadbailey59 · 2024-05-30T14:47:32Z

OK, I think I've gotten the new function calling approach where it needs to be. Let's get this one merged, and I can remove the function call frame types in a follow-up PR. @aconchillo

examples/patient-intake/bot.py

aconchillo · 2024-05-30T15:33:12Z

examples/patient-intake/bot.py

+        self._context.add_message(
+            {"role": "system", "content": "Finally, ask the user the reason for their doctor visit today. Once they answer, call the list_visit_reasons function."})
+        await llm.process_frame(OpenAILLMContextFrame(self._context), FrameDirection.DOWNSTREAM)
+        pass


should all these functions return None?

They do implicitly, but maybe better to be more explicit about it, if so.

src/pipecat/services/openai.py

src/pipecat/processors/logger.py

aconchillo reviewed May 23, 2024

View reviewed changes

examples/foundational/10-wake-word.py Outdated Show resolved Hide resolved

aconchillo reviewed May 23, 2024

View reviewed changes

src/pipecat/processors/aggregators/llm_response.py Outdated Show resolved Hide resolved

chadbailey59 added 4 commits May 24, 2024 17:56

added function calling code back

aec93b7

removed old llm_context file

aeeea64

added integration testing for openai

f39b20d

added function calling example

729aca3

chadbailey59 force-pushed the cb/function-calling branch from 3418f19 to 729aca3 Compare May 24, 2024 17:56

aconchillo reviewed May 24, 2024

View reviewed changes

src/pipecat/utils/test_frame_processor.py Show resolved Hide resolved

aconchillo reviewed May 24, 2024

View reviewed changes

src/pipecat/services/openai.py Show resolved Hide resolved

aconchillo reviewed May 24, 2024

View reviewed changes

examples/foundational/15-function-calling.py Outdated Show resolved Hide resolved

chadbailey59 added 4 commits May 28, 2024 16:12

added function callbacks

5027cb4

added function start callback

e13d8ca

fixup

55421b1

fixup

a518c3d

chadbailey59 requested a review from aconchillo May 28, 2024 17:07

aconchillo reviewed May 28, 2024

View reviewed changes

src/pipecat/frames/frames.py Outdated Show resolved Hide resolved

aconchillo reviewed May 28, 2024

View reviewed changes

chadbailey59 added 5 commits May 28, 2024 20:20

added different return type support for function calling

4a1e032

intake example working

d048525

added frame loggers

8cb0ee3

cleanup

7bde9c8

fixup

fcbe480

chadbailey59 requested a review from aconchillo May 30, 2024 14:47