ChatNVIDIA silently sets max_tokens to 16 #27

SimJeg · 2024-04-23T08:20:23Z

Hello,

I noticed that the following code only produces 16 completion tokens :

from langchain_nvidia_ai_endpoints.chat_models import ChatNVIDIA
llm = ChatNVIDIA(model="ai-mixtral-8x7b-instruct")
llm.invoke("Tell me a story about langchain")

ChatMessage(content=' Once upon a time, in a world not too different from ours, there was', response_metadata={'token_usage': {'prompt_tokens': 16, 'total_tokens': 32, 'completion_tokens': 16}, 'model_name': 'ai-mixtral-8x7b-instruct'}, role='assistant')

And it seems to be the same with other models (I tried ai-llama3-8b and ai-gemma-7b for instance).

The default max_tokens value of ChatNVIDIA is None and not 16. If we don't provide max_tokens it would be much more natural and less confusing if the model use the maximum context length of the model (e.g. 32k for ai-mixtral-8x7b-instruct).

Thanks !

The text was updated successfully, but these errors were encountered:

pentschev · 2024-04-23T10:22:23Z

I have the same experience, except if using the "old" models whose names have the "playground_" prefix where the max_tokens is definitely larger than 16. Therefore, I don't think this is necessarily a problem with ChatNVIDIA but rather with limits of the models themselves.

mattf · 2024-05-08T16:01:22Z

resolved by #38

SimJeg changed the title ~~ChatNVIDIA silently set max_tokens to 16~~ ChatNVIDIA silently sets max_tokens to 16 Apr 23, 2024

mattf mentioned this issue May 8, 2024

Changing max_tokens default value to some higher value may be 512/1024 #34

Closed

mattf self-assigned this May 8, 2024

mattf closed this as completed May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChatNVIDIA silently sets max_tokens to 16 #27

ChatNVIDIA silently sets max_tokens to 16 #27

SimJeg commented Apr 23, 2024

pentschev commented Apr 23, 2024

mattf commented May 8, 2024

ChatNVIDIA silently sets max_tokens to 16 #27

ChatNVIDIA silently sets max_tokens to 16 #27

Comments

SimJeg commented Apr 23, 2024

pentschev commented Apr 23, 2024

mattf commented May 8, 2024