Adding End-Of-Generation-Token parameter for text generation Inference API #376

aryananumula · 2024-01-02T00:38:53Z

Is your feature request related to a problem? Please describe.
While using the Inference API for a chatbot-style text-generation model, such as openchat-3.5, it is not possible to set an end of generation token.

Describe the solution you'd like
Addition of the end_of_generation_token parameter to the Inference API for text generation models.

Describe alternatives you've considered
Setting max_new_tokens to 1, and then generating new tokens and looking for a certain token to stop at.

Additional context
There is no additional context for this request.

The text was updated successfully, but these errors were encountered:

MichaelVandi · 2024-05-20T15:33:31Z

You can do something like this

{
    "inputs": "What is Deep Learning?",
    "parameters": {
        "max_new_tokens": 300,
        "stop": ["<|end_of_text|>", "<|endoftext|>", "}"]
    }
}

Where parameters.stop is an array of eos tokens

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding End-Of-Generation-Token parameter for text generation Inference API #376

Adding End-Of-Generation-Token parameter for text generation Inference API #376

aryananumula commented Jan 2, 2024

MichaelVandi commented May 20, 2024

Adding End-Of-Generation-Token parameter for text generation Inference API #376

Adding End-Of-Generation-Token parameter for text generation Inference API #376

Comments

aryananumula commented Jan 2, 2024

MichaelVandi commented May 20, 2024