You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
xinference的配置如下
部署时,报错如下:
Server error: 503 - [address=0.0.0.0:43789, pid=359] CUDA error: no kernel image is available for execution on the device CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1 Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.
当前显卡为:
Expected behavior / 期待表现
可以正常部署qwen2-instruct
The text was updated successfully, but these errors were encountered:
System Info / 系統信息
使用docker启动xinference-local,使用的镜像是xprobe/xinference:v1.0.0
Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?
Version info / 版本信息
v1.0.0
The command used to start Xinference / 用以启动 xinference 的命令
/usr/local/bin/xinference-local --host 0.0.0.0 --port 9997
Reproduction / 复现过程
xinference的配置如下
部署时,报错如下:
Server error: 503 - [address=0.0.0.0:43789, pid=359] CUDA error: no kernel image is available for execution on the device CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. For debugging consider passing CUDA_LAUNCH_BLOCKING=1 Compile with
TORCH_USE_CUDA_DSA
to enable device-side assertions.当前显卡为:
Expected behavior / 期待表现
可以正常部署qwen2-instruct
The text was updated successfully, but these errors were encountered: