Support continue final message #2733

drbh · 2024-11-08T19:04:34Z

This PR adds the continue_final_message param to chat requests to allow users to continue the last message. This is useful when prefilling a response from the model. Please see https://huggingface.co/docs/transformers/main/en/chat_templating#what-does-continuefinalmessage-do for details

Narsil · 2024-11-15T12:17:01Z

integration-tests/models/test_continue_final_message.py

+        disable_grammar_support=False,
+        use_flash_attention=False,


Remove those.

removed in latest commit

Narsil · 2024-11-15T12:17:52Z

integration-tests/models/test_continue_final_message.py

+            "max_tokens": 30,
+            "stream": False,
+            "seed": 1337,
+            "continue_final_message": False,


Why can't we assume that having a final assistant message means that users what continuation ?

There's no use case where the LLM produces a user output right ?

thats an interesting idea, currently a conversation could hypothetically contain back to back assistant messages (assuming the template allows it)

ie.

<system> <user> <assistant> <assistant>

I could see this pattern being used when injecting some information, for example a request may contain messages like below

"messages": [ {"role": "system", "content": "system message"}, {"role": "user", "content": "whats ticket 10 about?"}, {"role": "assistant", "content": "Ticket Information:\nTicket ID: 10\nTicket Title: Ticket 10\nTicket Description: This is a test ticket\nTicket Status: Open\nTicket Priority: High\nTicket Type: Bug\nTicket Assignee: John Doe\nTicket Reporter: Jane Doe\nTicket Created: 2022-01-01 00:00:00\nTicket Updated: 2022-01-01 00:00:00\n"}, ],

and the user may expect a new message, rather than a continuation of the last one.

however, it would be nice to disallow this pattern to avoid the extra argument, and always just continue the last message if its an assistant (or other heuristic)

Does the pattern exist in the wild or not ? If there's no evidence, let's not guess about what might become, and focus instead on what is.

We can add a flag when the current behavior proposal is not the only one anymore (for instance when there's a new use case). Until then, let's aim for simplicity.

great point! I've removed the continue_final_message and prefer assuming if the final message is an assistant we will continue it

Narsil reviewed Nov 15, 2024

View reviewed changes

drbh force-pushed the support-continue-final-message branch 2 times, most recently from 0a011fb to e837d9f Compare November 21, 2024 17:02

drbh added 7 commits November 22, 2024 14:10

feat: support continue_final_message param in chat request

c782a78

feat: add test for continue final message

b2ae92e

fix: bump openapi docs

d628014

fix: remove continue_final_message chat request param

70066e6

fix: remove unneeded launcher args in continue test

7486d93

fix: bump test output

4069955

fix: remove accidentally included guideline from rebase

8770b39

drbh force-pushed the support-continue-final-message branch from c17635f to 8770b39 Compare November 22, 2024 19:11

fix: remove guideline tests

13a75ac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support continue final message #2733

Support continue final message #2733

drbh commented Nov 8, 2024

Narsil Nov 15, 2024

drbh Nov 21, 2024

Narsil Nov 15, 2024

drbh Nov 15, 2024 •

edited

Loading

Narsil Nov 17, 2024 •

edited

Loading

drbh Nov 21, 2024

Support continue final message #2733

Are you sure you want to change the base?

Support continue final message #2733

Conversation

drbh commented Nov 8, 2024

Narsil Nov 15, 2024

Choose a reason for hiding this comment

drbh Nov 21, 2024

Choose a reason for hiding this comment

Narsil Nov 15, 2024

Choose a reason for hiding this comment

drbh Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

Narsil Nov 17, 2024 • edited Loading

Choose a reason for hiding this comment

drbh Nov 21, 2024

Choose a reason for hiding this comment

drbh Nov 15, 2024 •

edited

Loading

Narsil Nov 17, 2024 •

edited

Loading