We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ubuntu 20.04 Host, Docker image v2.3.1
Launch docker image with PREFIX_CACHING=0 (or false) and prefix caching is still enabled
Respect env var for prefix caching
The text was updated successfully, but these errors were encountered:
This also prevents you from using ATTENTION=paged since the prefix caching is always true which crashes the model shards on launch
ATTENTION=paged
Sorry, something went wrong.
To disable prefix caching you have to set both USE_PREFIX_CACHING=0 AND PREFIX_CACHING=0 in v2.3.1
USE_PREFIX_CACHING=0
PREFIX_CACHING=0
No branches or pull requests
System Info
Ubuntu 20.04 Host, Docker image v2.3.1
Information
Tasks
Reproduction
Launch docker image with PREFIX_CACHING=0 (or false) and prefix caching is still enabled
Expected behavior
Respect env var for prefix caching
The text was updated successfully, but these errors were encountered: