deepseek qwen 14b 响应超时时间 #2882

SeesawLiu · 2025-02-18T13:02:17Z

System Info / 系統信息

inference 12.2 容器版本

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？

docker / docker
pip install / 通过 pip install 安装
installation from source / 从源码安装

Version info / 版本信息

12.2

The command used to start Xinference / 用以启动 xinference 的命令

curl -X 'POST' \ 'http://127.0.0.12:9997/v1/models' \ -H 'accept: application/json' \ -H 'Content-Type: application/json' \ -d '{ "model_engine": "vllm", "model_name": "deepseek-r1-distill-qwen", "model_path": "/app/models/DeepSeek-R1-Distill-Qwen-14B", "worker_ip": "10.0.6.101", "max_model_len": 57744 }'

Reproduction / 复现过程

使用api提问：例如：提问发动机冒烟的原因是什么。响应超过30秒，停止输出。

Expected behavior / 期待表现

完整的输出deepseek的思考

The text was updated successfully, but these errors were encountered:

jackleeforce · 2025-02-19T01:31:01Z

same with me.

qinxuye · 2025-02-19T02:48:48Z

是分布式？

SeesawLiu · 2025-02-21T01:35:23Z

分布式部署，不过deepseek运行的服务器只有一张显卡

XprobeBot added this to the v1.x milestone Feb 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepseek qwen 14b 响应超时时间 #2882

deepseek qwen 14b 响应超时时间 #2882

SeesawLiu commented Feb 18, 2025 •

edited

Loading

jackleeforce commented Feb 19, 2025

qinxuye commented Feb 19, 2025

SeesawLiu commented Feb 21, 2025

deepseek qwen 14b 响应超时时间 #2882

deepseek qwen 14b 响应超时时间 #2882

Comments

SeesawLiu commented Feb 18, 2025 • edited Loading

System Info / 系統信息

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece？

Version info / 版本信息

The command used to start Xinference / 用以启动 xinference 的命令

Reproduction / 复现过程

Expected behavior / 期待表现

jackleeforce commented Feb 19, 2025

qinxuye commented Feb 19, 2025

SeesawLiu commented Feb 21, 2025

SeesawLiu commented Feb 18, 2025 •

edited

Loading