Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

deepseek qwen 14b 响应超时时间 #2882

Open
1 of 3 tasks
SeesawLiu opened this issue Feb 18, 2025 · 3 comments
Open
1 of 3 tasks

deepseek qwen 14b 响应超时时间 #2882

SeesawLiu opened this issue Feb 18, 2025 · 3 comments
Milestone

Comments

@SeesawLiu
Copy link

SeesawLiu commented Feb 18, 2025

System Info / 系統信息

inference 12.2 容器版本

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?

  • docker / docker
  • pip install / 通过 pip install 安装
  • installation from source / 从源码安装

Version info / 版本信息

12.2

The command used to start Xinference / 用以启动 xinference 的命令

curl -X 'POST' \ 'http://127.0.0.12:9997/v1/models' \ -H 'accept: application/json' \ -H 'Content-Type: application/json' \ -d '{ "model_engine": "vllm", "model_name": "deepseek-r1-distill-qwen", "model_path": "/app/models/DeepSeek-R1-Distill-Qwen-14B", "worker_ip": "10.0.6.101", "max_model_len": 57744 }'

Reproduction / 复现过程

使用api提问:例如:提问发动机冒烟的原因是什么。响应超过30秒,停止输出。

Expected behavior / 期待表现

完整的输出deepseek的思考

@XprobeBot XprobeBot added this to the v1.x milestone Feb 18, 2025
@jackleeforce
Copy link

same with me.

@qinxuye
Copy link
Contributor

qinxuye commented Feb 19, 2025

是分布式?

@SeesawLiu
Copy link
Author

分布式部署,不过deepseek运行的服务器只有一张显卡

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants