Skip to content
This repository was archived by the owner on Feb 15, 2025. It is now read-only.

feat:(backend): Implement Context Cancellation #280

Open
1 task
Tracked by #320
gphorvath opened this issue Mar 21, 2024 · 0 comments
Open
1 task
Tracked by #320

feat:(backend): Implement Context Cancellation #280

gphorvath opened this issue Mar 21, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@gphorvath
Copy link

Type: Feature

Description: Cancelling context cancels generation to free resources to retry requests or otherwise stop inference.

User Story:
For each backend (VLLM, Embeddings, Llama-cpp-python, Whisper), implement any changes from the SDK required to perform context cancellation.

Acceptance Criteria:

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants