Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error: "POST /generate HTTP/1.1" 404 Not Found when running Locust tool against vLLM model server #777

Open
Edwinhr716 opened this issue Aug 14, 2024 · 0 comments

Comments

@Edwinhr716
Copy link
Member

Whenever running the locust benchmarking tool against a vLLM model server, I get this error

POST /generate HTTP/1.1" 404 Not Found

After some investigation, it looks like it was caused by #694, which switched the entrypoint from vllm.entrypoints.api_server to vllm.entrypoints.openai.api_server. the openAI entrypoint does not have a /generate endpoint, which breaks the locust benchmarking tool.

Looks like it was called out here #694 (comment) but never addressed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant