Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Query : What is the intention behind the specified max_new_tokens/max_tokens in the video understanding notebooks ? #83

Open
PHOENIXFURY007 opened this issue Nov 27, 2024 · 1 comment

Comments

@PHOENIXFURY007
Copy link

04_video_understanding.ipynb
max_new_tokens=2048

04_video_understanding_vllm.ipynb
max_tokens=4096

Does the token size here refer to the total context window? Or does it refer to the context window for the chat reply?
It was mentioned in the introduction that the Long multimodal context window consists of 64K tokens. So can you elaborate on it a bit? Thanks

@PHOENIXFURY007 PHOENIXFURY007 changed the title Query : Whta is the intention behind the specified max_new_tokens/max_tokens in the video understanding notebooks ? Query : What is the intention behind the specified max_new_tokens/max_tokens in the video understanding notebooks ? Nov 27, 2024
@xffxff
Copy link
Collaborator

xffxff commented Nov 28, 2024

04_video_understanding.ipynb max_new_tokens=2048

04_video_understanding_vllm.ipynb max_tokens=4096

Does the token size here refer to the total context window? Or does it refer to the context window for the chat reply? It was mentioned in the introduction that the Long multimodal context window consists of 64K tokens. So can you elaborate on it a bit? Thanks

max_tokens and max_new_tokens do not refer to the total context window size. Instead, they specify the maximum number of tokens that the model can generate in a single output sequence. These values must be less than the total context window size of 64k tokens. You can set them to larger values if you have enough GPU memory available.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants