Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

xinference v1.0.0无法正常启动qwen2-instruct #2564

Open
1 of 3 tasks
m199369309 opened this issue Nov 21, 2024 · 0 comments
Open
1 of 3 tasks

xinference v1.0.0无法正常启动qwen2-instruct #2564

m199369309 opened this issue Nov 21, 2024 · 0 comments
Labels
Milestone

Comments

@m199369309
Copy link

m199369309 commented Nov 21, 2024

System Info / 系統信息

使用docker启动xinference-local,使用的镜像是xprobe/xinference:v1.0.0

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?

  • docker / docker
  • pip install / 通过 pip install 安装
  • installation from source / 从源码安装

Version info / 版本信息

v1.0.0

The command used to start Xinference / 用以启动 xinference 的命令

/usr/local/bin/xinference-local --host 0.0.0.0 --port 9997

Reproduction / 复现过程

xinference的配置如下
image
部署时,报错如下:
image
Server error: 503 - [address=0.0.0.0:43789, pid=359] CUDA error: no kernel image is available for execution on the device CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1 Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.
当前显卡为:
image

Expected behavior / 期待表现

可以正常部署qwen2-instruct

@XprobeBot XprobeBot added the gpu label Nov 21, 2024
@XprobeBot XprobeBot added this to the v0.16 milestone Nov 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants