Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

deepseek-r1-qwen xinference 发布后,在测试界面报错 #2885

Open
1 of 3 tasks
SharkSyl opened this issue Feb 19, 2025 · 2 comments
Open
1 of 3 tasks

deepseek-r1-qwen xinference 发布后,在测试界面报错 #2885

SharkSyl opened this issue Feb 19, 2025 · 2 comments
Labels
Milestone

Comments

@SharkSyl
Copy link

SharkSyl commented Feb 19, 2025

System Info / 系統信息

xinference:v1.2.2-cpu

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?

  • docker / docker
  • pip install / 通过 pip install 安装
  • installation from source / 从源码安装

Version info / 版本信息

xinference:v1.2.2-cpu,在容器内部通过 pip list获得的信息

accelerate                     1.3.0
aiofiles                       23.2.1
aiohappyeyeballs               2.4.6
aiohttp                        3.11.12
aioprometheus                  23.12.0
aiosignal                      1.3.2
aliyun-python-sdk-core         2.16.0
aliyun-python-sdk-kms          2.16.5
altair                         5.5.0
annotated-types                0.7.0
antlr4-python3-runtime         4.9.3
anyascii                       0.3.2
anyio                          4.8.0
archspec                       0.2.1
argcomplete                    3.5.3
async-timeout                  5.0.1
attrs                          25.1.0
audioread                      3.0.1
auto_gptq                      0.7.1
autoawq                        0.2.5
autoawq_kernels                0.0.9
av                             14.1.0
babel                          2.17.0
bcrypt                         4.2.1
beautifulsoup4                 4.13.3
bibtexparser                   2.0.0b8
bitsandbytes                   0.45.2
black                          25.1.0
blis                           1.2.0
boltons                        23.0.0
boto3                          1.28.64
botocore                       1.31.85
Brotli                         1.0.9
cached_path                    1.6.7
cachetools                     5.5.1
catalogue                      2.0.10
cbor                           1.0.0
cdifflib                       1.2.9
certifi                        2023.7.22
cffi                           1.15.1
charset-normalizer             2.0.4
chattts                        0.2.2
click                          8.1.8
clldutils                      3.24.1
cloudpathlib                   0.20.0
cloudpickle                    3.1.1
cn2an                          0.5.23
colorama                       0.4.6
coloredlogs                    15.0.1
colorlog                       6.9.0
conda                          23.10.0
conda-content-trust            0.2.0
conda-libmamba-solver          23.11.1
conda-package-handling         2.2.0
conda_package_streaming        0.9.0
confection                     0.1.5
conformer                      0.3.2
contourpy                      1.3.1
controlnet-aux                 0.0.9
crcmod                         1.7
cryptography                   41.0.3
csvw                           3.5.1
curated-tokenizers             0.0.9
curated-transformers           0.1.1
cycler                         0.12.1
cymem                          2.0.11
Cython                         3.0.11
datamodel-code-generator       0.27.2
datasets                       3.2.0
dateparser                     1.1.8
decorator                      5.1.1
Deprecated                     1.2.18
diffusers                      0.32.2
dill                           0.3.8
diskcache                      5.6.3
Distance                       0.1.3
distro                         1.9.0
dlinfo                         2.0.0
docopt                         0.6.2
ecdsa                          0.19.0
editdistance                   0.8.1
einops                         0.8.0
einx                           0.3.0
encodec                        0.1.1
espeakng-loader                0.2.4
eva-decord                     0.6.1
fastapi                        0.115.8
ffmpy                          0.5.0
filelock                       3.13.1
FlagEmbedding                  1.3.4
flatbuffers                    25.1.24
fonttools                      4.56.0
frozendict                     2.4.6
frozenlist                     1.5.0
fsspec                         2024.6.1
fugashi                        1.4.0
funasr                         1.1.16
fvcore                         0.1.5.post20221221
g2p-en                         2.1.0
gdown                          5.2.0
gekko                          1.2.1
genson                         1.3.0
gguf                           0.14.0
google-api-core                2.24.1
google-auth                    2.38.0
google-cloud-core              2.4.1
google-cloud-storage           2.19.0
google-crc32c                  1.6.0
google-resumable-media         2.7.2
googleapis-common-protos       1.67.0rc1
gradio                         4.26.0
gradio_client                  0.15.1
gruut                          2.4.0
gruut-ipa                      0.13.0
gruut-lang-de                  2.0.1
gruut-lang-en                  2.0.1
gruut-lang-es                  2.0.1
gruut-lang-fr                  2.0.2
h11                            0.14.0
hiredis                        3.1.0
httpcore                       1.0.7
httpx                          0.28.1
huggingface-hub                0.27.1
humanfriendly                  10.0
hydra-core                     1.3.2
HyperPyYAML                    1.2.2
idna                           3.4
ijson                          3.3.0
imageio                        2.37.0
imageio-ffmpeg                 0.6.0
importlib_metadata             8.6.1
importlib_resources            6.5.2
inflect                        5.6.2
inscriptis                     2.5.3
iopath                         0.1.10
ir_datasets                    0.5.9
isodate                        0.7.2
isort                          6.0.0
jaconv                         0.4.0
jamo                           0.4.1
jieba                          0.42.1
Jinja2                         3.1.4
jiter                          0.8.2
jj-pytorchvideo                0.1.5
jmespath                       0.10.0
joblib                         1.4.2
jsonlines                      1.2.0
jsonpatch                      1.32
jsonpointer                    2.1
jsonschema                     4.23.0
jsonschema-specifications      2024.10.1
kaldiio                        2.18.0
kiwisolver                     1.4.8
kokoro                         0.7.12
langcodes                      3.5.0
language_data                  1.3.0
language-tags                  1.2.0
lazy_loader                    0.4
libmambapy                     1.5.3
libnacl                        2.1.0
librosa                        0.10.2.post1
lightning                      2.5.0.post0
lightning-utilities            0.12.0
llama_cpp_python               0.3.7
llvmlite                       0.44.0
loguru                         0.7.3
loralib                        0.1.2
lxml                           5.3.0
lz4                            4.4.3
marisa-trie                    1.2.1
Markdown                       3.7
markdown-it-py                 3.0.0
MarkupSafe                     2.1.5
matplotlib                     3.10.0
mdurl                          0.1.2
mecab-python3                  1.0.10
misaki                         0.7.12
modelscope                     1.22.3
mpmath                         1.3.0
msgpack                        1.1.0
multidict                      6.1.0
multiprocess                   0.70.16
murmurhash                     1.0.12
mypy-extensions                1.0.0
narwhals                       1.25.2
natsort                        8.4.0
nemo_text_processing           1.0.2
networkx                       3.3
nltk                           3.9.1
num2words                      0.5.14
numba                          0.61.0
numpy                          1.26.4
nvidia-ml-py                   12.570.86
omegaconf                      2.3.0
onnxruntime                    1.20.1
onnxruntime-gpu                1.16.0
openai                         1.61.1
opencv-contrib-python-headless 4.11.0.86
opencv-python-headless         4.11.0.86
optimum                        1.24.0
orjson                         3.10.15
ormsgpack                      1.7.0
oss2                           2.19.1
packaging                      23.1
pandas                         2.2.3
parameterized                  0.9.0
passlib                        1.7.4
pathspec                       0.12.1
peft                           0.14.0
phonemizer-fork                3.3.2
pillow                         10.4.0
pip                            25.0
platformdirs                   4.3.6
pluggy                         1.0.0
pooch                          1.8.2
portalocker                    3.1.1
preshed                        3.0.9
proces                         0.1.7
propcache                      0.2.1
proto-plus                     1.26.0
protobuf                       5.29.3
psutil                         6.1.1
pyarrow                        19.0.0
pyasn1                         0.6.1
pyasn1_modules                 0.4.1
pybase16384                    0.3.8
pycosat                        0.6.6
pycparser                      2.21
pycryptodome                   3.21.0
pydantic                       2.10.6
pydantic_core                  2.27.2
pydub                          0.25.1
Pygments                       2.19.1
pykakasi                       2.3.0
pylatexenc                     2.10
pynini                         2.1.5
pynndescent                    0.5.13
pyOpenSSL                      23.2.0
pyparsing                      3.2.1
pypinyin                       0.53.0
PySocks                        1.7.1
python-crfsuite                0.9.11
python-dateutil                2.9.0.post0
python-jose                    3.3.0
python-multipart               0.0.20
pytorch-lightning              2.5.0.post0
pytorch-wpe                    0.0.1
pytz                           2025.1
PyYAML                         6.0.2
quantile-python                1.1
qwen-vl-utils                  0.0.10
rdflib                         7.1.3
redis                          5.2.1
referencing                    0.36.2
regex                          2024.11.6
requests                       2.32.3
rfc3986                        1.5.0
rich                           13.9.4
rouge                          1.0.1
rpds-py                        0.22.3
rsa                            4.9
ruamel.yaml                    0.18.10
ruamel.yaml.clib               0.2.12
ruff                           0.9.5
s3transfer                     0.7.0
sacremoses                     0.1.1
safetensors                    0.5.2
scikit-image                   0.25.1
scikit-learn                   1.6.1
scipy                          1.15.1
segments                       2.2.1
semantic-version               2.10.0
sentence-transformers          3.4.1
sentencepiece                  0.2.0
setproctitle                   1.3.4
setuptools                     68.0.0
shellingham                    1.5.4
silero-vad                     5.1.2
six                            1.17.0
smart-open                     7.1.0
sniffio                        1.3.1
soundfile                      0.13.1
soupsieve                      2.6
soxr                           0.5.0.post1
spacy                          3.8.4
spacy-curated-transformers     0.3.0
spacy-legacy                   3.0.12
spacy-loggers                  1.0.5
srsly                          2.5.1
sse-starlette                  2.2.1
starlette                      0.45.3
sympy                          1.13.1
tabulate                       0.9.0
tblib                          3.0.0
tensorboardX                   2.6.2.2
tensorizer                     2.9.1
termcolor                      2.5.0
thinc                          8.3.4
threadpoolctl                  3.5.0
tifffile                       2025.1.10
tiktoken                       0.8.0
timm                           0.6.7
tokenizers                     0.21.0
tomli                          2.2.1
tomlkit                        0.12.0
torch                          2.6.0+cpu
torch-complex                  0.4.4
torchaudio                     2.6.0+cpu
torchdiffeq                    0.2.5
torchmetrics                   1.6.1
torchvision                    0.21.0+cpu
tqdm                           4.67.1
transformers                   4.48.3
transformers-stream-generator  0.0.5
trec-car-tools                 2.6
truststore                     0.8.0
typer                          0.11.1
typing_extensions              4.12.2
tzdata                         2025.1
tzlocal                        5.2
umap-learn                     0.5.7
unidic-lite                    1.0.8
unlzw3                         0.2.3
uritemplate                    4.1.1
urllib3                        1.26.18
uvicorn                        0.34.0
uvloop                         0.21.0
vector-quantize-pytorch        1.17.3
verovio                        5.0.0
vocos                          0.1.0
warc3-wet                      0.2.5
warc3-wet-clueweb09            0.2.5
wasabi                         1.1.3
weasel                         0.4.1
websockets                     11.0.3
WeTextProcessing               1.0.3
wget                           3.2
wheel                          0.41.2
wrapt                          1.17.2
x-transformers                 2.0.2
xinference                     1.2.2
xoscar                         0.4.6
xxhash                         3.5.0
yacs                           0.1.8
yarl                           1.18.3
zipp                           3.21.0
zlib-state                     0.1.9
zstandard                      0.19.0

The command used to start Xinference / 用以启动 xinference 的命令

docker compose 启动,然后进入容器内部执行命令

xinference launch --model_path /root/.cache/huggingface/hub/models--deepseek-ai--DeepSeek-R1-Distill-Qwen-7B/snapshots/6602cadec947dbb53e64f3d8d6425320b2197247 --model-engine Transformers --model-name deepseek-r1-distill-qwen --size-in-billions 7 --model-format pytorch --quantization none

docker compose 文件如下

services:
  xinference:
    image: dockerhub.kubekey.local/misaigc/xinference:v1.2.2-cpu
    command: xinference-local -H 0.0.0.0
    restart: always
    ports:
      - "9997:9997"
    volumes:
      - ./volumes/.xinference:/root/.xinference
      - ./volumes/.cache/huggingface:/root/.cache/huggingface
      - ./volumes/.cache/modelscope:/root/.cache/modelscope

Reproduction / 复现过程

复现过程:

  1. 通过docker comose up -d启动服务器
  2. 通过docker compose exec 进入容器
  3. 执行命令
xinference launch --model_path /root/.cache/huggingface/hub/models--deepseek-ai--DeepSeek-R1-Distill-Qwen-7B/snapshots/6602cadec947dbb53e64f3d8d6425320b2197247 --model-engine Transformers --model-name deepseek-r1-distill-qwen --size-in-billions 7 --model-format pytorch --quantization none

4.通过web进入测试界面

Image
5. 输入测试指令

Image

这是console中的详细报错信息
xinference-1 | 2025-02-19T03:24:16.934029010Z loading configuration file /root/.xinference/cache/deepseek-r1-distill-qwen-pytorch-7b/config.json
xinference-1 | 2025-02-19T03:24:16.936248465Z 2025-02-19 11:24:16,936 transformers.configuration_utils 895 INFO Model config Qwen2Config {
xinference-1 | 2025-02-19T03:24:16.936267491Z "_name_or_path": "/root/.xinference/cache/deepseek-r1-distill-qwen-pytorch-7b",
xinference-1 | 2025-02-19T03:24:16.936272070Z "architectures": [
xinference-1 | 2025-02-19T03:24:16.936275323Z "Qwen2ForCausalLM"
xinference-1 | 2025-02-19T03:24:16.936278260Z ],
xinference-1 | 2025-02-19T03:24:16.936281439Z "attention_dropout": 0.0,
xinference-1 | 2025-02-19T03:24:16.936284673Z "bos_token_id": 151643,
xinference-1 | 2025-02-19T03:24:16.936287578Z "eos_token_id": 151643,
xinference-1 | 2025-02-19T03:24:16.936290368Z "hidden_act": "silu",
xinference-1 | 2025-02-19T03:24:16.936293466Z "hidden_size": 3584,
xinference-1 | 2025-02-19T03:24:16.936296285Z "initializer_range": 0.02,
xinference-1 | 2025-02-19T03:24:16.936299059Z "intermediate_size": 18944,
xinference-1 | 2025-02-19T03:24:16.936302158Z "max_position_embeddings": 131072,
xinference-1 | 2025-02-19T03:24:16.936305011Z "max_window_layers": 28,
xinference-1 | 2025-02-19T03:24:16.936310855Z "model_type": "qwen2",
xinference-1 | 2025-02-19T03:24:16.936313673Z "num_attention_heads": 28,
xinference-1 | 2025-02-19T03:24:16.936316749Z "num_hidden_layers": 28,
xinference-1 | 2025-02-19T03:24:16.936319647Z "num_key_value_heads": 4,
xinference-1 | 2025-02-19T03:24:16.936322531Z "rms_norm_eps": 1e-06,
xinference-1 | 2025-02-19T03:24:16.936337283Z "rope_scaling": null,
xinference-1 | 2025-02-19T03:24:16.936341011Z "rope_theta": 10000,
xinference-1 | 2025-02-19T03:24:16.936344211Z "sliding_window": null,
xinference-1 | 2025-02-19T03:24:16.936347028Z "tie_word_embeddings": false,
xinference-1 | 2025-02-19T03:24:16.936349794Z "torch_dtype": "float32",
xinference-1 | 2025-02-19T03:24:16.936352577Z "transformers_version": "4.48.3",
xinference-1 | 2025-02-19T03:24:16.936355814Z "use_cache": true,
xinference-1 | 2025-02-19T03:24:16.936359162Z "use_mrope": false,
xinference-1 | 2025-02-19T03:24:16.936362180Z "use_sliding_window": false,
xinference-1 | 2025-02-19T03:24:16.936365215Z "vocab_size": 152064
xinference-1 | 2025-02-19T03:24:16.936368107Z }
xinference-1 | 2025-02-19T03:24:16.936370827Z
xinference-1 | 2025-02-19T03:24:16.936378250Z Model config Qwen2Config {
xinference-1 | 2025-02-19T03:24:16.936381539Z "_name_or_path": "/root/.xinference/cache/deepseek-r1-distill-qwen-pytorch-7b",
xinference-1 | 2025-02-19T03:24:16.936384915Z "architectures": [
xinference-1 | 2025-02-19T03:24:16.936387883Z "Qwen2ForCausalLM"
xinference-1 | 2025-02-19T03:24:16.936390630Z ],
xinference-1 | 2025-02-19T03:24:16.936393667Z "attention_dropout": 0.0,
xinference-1 | 2025-02-19T03:24:16.936396683Z "bos_token_id": 151643,
xinference-1 | 2025-02-19T03:24:16.936399721Z "eos_token_id": 151643,
xinference-1 | 2025-02-19T03:24:16.936402424Z "hidden_act": "silu",
xinference-1 | 2025-02-19T03:24:16.936405536Z "hidden_size": 3584,
xinference-1 | 2025-02-19T03:24:16.936408509Z "initializer_range": 0.02,
xinference-1 | 2025-02-19T03:24:16.936411273Z "intermediate_size": 18944,
xinference-1 | 2025-02-19T03:24:16.936414047Z "max_position_embeddings": 131072,
xinference-1 | 2025-02-19T03:24:16.936417334Z "max_window_layers": 28,
xinference-1 | 2025-02-19T03:24:16.936420220Z "model_type": "qwen2",
xinference-1 | 2025-02-19T03:24:16.936423136Z "num_attention_heads": 28,
xinference-1 | 2025-02-19T03:24:16.936426112Z "num_hidden_layers": 28,
xinference-1 | 2025-02-19T03:24:16.936428984Z "num_key_value_heads": 4,
xinference-1 | 2025-02-19T03:24:16.936432026Z "rms_norm_eps": 1e-06,
xinference-1 | 2025-02-19T03:24:16.936434993Z "rope_scaling": null,
xinference-1 | 2025-02-19T03:24:16.936438205Z "rope_theta": 10000,
xinference-1 | 2025-02-19T03:24:16.936441181Z "sliding_window": null,
xinference-1 | 2025-02-19T03:24:16.936443916Z "tie_word_embeddings": false,
xinference-1 | 2025-02-19T03:24:16.936446708Z "torch_dtype": "float32",
xinference-1 | 2025-02-19T03:24:16.936449521Z "transformers_version": "4.48.3",
xinference-1 | 2025-02-19T03:24:16.936452528Z "use_cache": true,
xinference-1 | 2025-02-19T03:24:16.936478069Z "use_mrope": false,
xinference-1 | 2025-02-19T03:24:16.936524542Z "use_sliding_window": false,
xinference-1 | 2025-02-19T03:24:16.936575880Z "vocab_size": 152064
xinference-1 | 2025-02-19T03:24:16.936606938Z }
xinference-1 | 2025-02-19T03:24:16.936620058Z
xinference-1 | 2025-02-19T03:24:17.263300773Z 2025-02-19 11:24:17,263 transformers.modeling_utils 895 INFO loading weights file /root/.xinference/cache/deepseek-r1-distill-qwen-pytorch-7b/model.safetensors.index.json
xinference-1 | 2025-02-19T03:24:17.263335648Z loading weights file /root/.xinference/cache/deepseek-r1-distill-qwen-pytorch-7b/model.safetensors.index.json
xinference-1 | 2025-02-19T03:24:17.263692207Z 2025-02-19 11:24:17,263 transformers.modeling_utils 895 INFO Instantiating Qwen2ForCausalLM model under default dtype torch.float32.
xinference-1 | 2025-02-19T03:24:17.263743772Z Instantiating Qwen2ForCausalLM model under default dtype torch.float32.
xinference-1 | 2025-02-19T03:24:17.265202469Z 2025-02-19 11:24:17,265 transformers.generation.configuration_utils 895 INFO Generate config GenerationConfig {
xinference-1 | 2025-02-19T03:24:17.265220364Z "bos_token_id": 151643,
xinference-1 | 2025-02-19T03:24:17.265224455Z "eos_token_id": 151643
xinference-1 | 2025-02-19T03:24:17.265227562Z }
xinference-1 | 2025-02-19T03:24:17.265230429Z
xinference-1 | 2025-02-19T03:24:17.265257203Z Generate config GenerationConfig {
xinference-1 | 2025-02-19T03:24:17.265264311Z "bos_token_id": 151643,
xinference-1 | 2025-02-19T03:24:17.265267612Z "eos_token_id": 151643
xinference-1 | 2025-02-19T03:24:17.265270579Z }
xinference-1 | 2025-02-19T03:24:17.265273641Z
Loading checkpoint shards: 100%|█████████████████| 2/2 [00:02<00:00, 1.27s/it]
xinference-1 | 2025-02-19T03:24:19.856022983Z 2025-02-19 11:24:19,855 transformers.modeling_utils 895 INFO All model checkpoint weights were used when initializing Qwen2ForCausalLM.
xinference-1 | 2025-02-19T03:24:19.856043588Z
xinference-1 | 2025-02-19T03:24:19.856116166Z All model checkpoint weights were used when initializing Qwen2ForCausalLM.
xinference-1 | 2025-02-19T03:24:19.856124339Z
xinference-1 | 2025-02-19T03:24:19.856188283Z 2025-02-19 11:24:19,856 transformers.modeling_utils 895 INFO All the weights of Qwen2ForCausalLM were initialized from the model checkpoint at /root/.xinference/cache/deepseek-r1-distill-qwen-pytorch-7b.
xinference-1 | 2025-02-19T03:24:19.856202856Z If your task is similar to the task the model of the checkpoint was trained on, you can already use Qwen2ForCausalLM for predictions without further training.
xinference-1 | 2025-02-19T03:24:19.856221471Z All the weights of Qwen2ForCausalLM were initialized from the model checkpoint at /root/.xinference/cache/deepseek-r1-distill-qwen-pytorch-7b.
xinference-1 | 2025-02-19T03:24:19.856232251Z If your task is similar to the task the model of the checkpoint was trained on, you can already use Qwen2ForCausalLM for predictions without further training.
xinference-1 | 2025-02-19T03:24:20.033863228Z 2025-02-19 11:24:20,033 transformers.generation.configuration_utils 895 INFO loading configuration file /root/.xinference/cache/deepseek-r1-distill-qwen-pytorch-7b/generation_config.json
xinference-1 | 2025-02-19T03:24:20.033914489Z loading configuration file /root/.xinference/cache/deepseek-r1-distill-qwen-pytorch-7b/generation_config.json
xinference-1 | 2025-02-19T03:24:20.034287472Z 2025-02-19 11:24:20,034 transformers.generation.configuration_utils 895 INFO Generate config GenerationConfig {
xinference-1 | 2025-02-19T03:24:20.034304324Z "bos_token_id": 151646,
xinference-1 | 2025-02-19T03:24:20.034307814Z "do_sample": true,
xinference-1 | 2025-02-19T03:24:20.034310716Z "eos_token_id": 151643,
xinference-1 | 2025-02-19T03:24:20.034313584Z "temperature": 0.6,
xinference-1 | 2025-02-19T03:24:20.034316389Z "top_p": 0.95
xinference-1 | 2025-02-19T03:24:20.034319111Z }
xinference-1 | 2025-02-19T03:24:20.034321741Z
xinference-1 | 2025-02-19T03:24:20.034336378Z Generate config GenerationConfig {
xinference-1 | 2025-02-19T03:24:20.034349354Z "bos_token_id": 151646,
xinference-1 | 2025-02-19T03:24:20.034352721Z "do_sample": true,
xinference-1 | 2025-02-19T03:24:20.034356622Z "eos_token_id": 151643,
xinference-1 | 2025-02-19T03:24:20.034359602Z "temperature": 0.6,
xinference-1 | 2025-02-19T03:24:20.034362617Z "top_p": 0.95
xinference-1 | 2025-02-19T03:24:20.034365585Z }
xinference-1 | 2025-02-19T03:24:20.034368537Z
xinference-1 | 2025-02-19T03:24:20.039889925Z 2025-02-19 11:24:20,039 xinference.core.model 895 INFO ModelActor(deepseek-r1-distill-qwen-0) loaded
xinference-1 | 2025-02-19T03:24:20.041223817Z 2025-02-19 11:24:20,041 xinference.core.worker 56 INFO [request fc1d9df2-ee70-11ef-9cb5-0242ac150002] Leave launch_builtin_model, elapsed time: 9 s
xinference-1 | 2025-02-19T03:24:29.607775623Z 2025-02-19 11:24:29,604 xinference.api.restful_api 1 ERROR Handling request http://localhost:9997/deepseek-r1-distill-qwen/run/predict failed: Unable to generate pydantic-core schema for <class 'starlette.requests.Request'>. Set arbitrary_types_allowed=True in the model_config to ignore this error or implement __get_pydantic_core_schema__ on your type to fully support it.
xinference-1 | 2025-02-19T03:24:29.607800455Z
xinference-1 | 2025-02-19T03:24:29.607804169Z If you got this error by calling handler() within __get_pydantic_core_schema__ then you likely need to call handler.generate_schema(<some type>) since we do not call __get_pydantic_core_schema__ on <some type> otherwise to avoid infinite recursion.
xinference-1 | 2025-02-19T03:24:29.607807979Z
xinference-1 | 2025-02-19T03:24:29.607810770Z For further information visit https://errors.pydantic.dev/2.10/u/schema-for-unknown-type
xinference-1 | 2025-02-19T03:24:29.607814056Z Traceback (most recent call last):
xinference-1 | 2025-02-19T03:24:29.607816938Z File "/opt/conda/lib/python3.11/site-packages/pydantic/type_adapter.py", line 271, in _init_core_attrs
xinference-1 | 2025-02-19T03:24:29.607820013Z self.core_schema = _getattr_no_parents(self._type, 'pydantic_core_schema')
xinference-1 | 2025-02-19T03:24:29.607822873Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.607825666Z File "/opt/conda/lib/python3.11/site-packages/pydantic/type_adapter.py", line 55, in _getattr_no_parents
xinference-1 | 2025-02-19T03:24:29.607828609Z raise AttributeError(attribute)
xinference-1 | 2025-02-19T03:24:29.607841597Z AttributeError: pydantic_core_schema
xinference-1 | 2025-02-19T03:24:29.607844739Z
xinference-1 | 2025-02-19T03:24:29.607847597Z During handling of the above exception, another exception occurred:
xinference-1 | 2025-02-19T03:24:29.607850479Z
xinference-1 | 2025-02-19T03:24:29.607853515Z Traceback (most recent call last):
xinference-1 | 2025-02-19T03:24:29.607856341Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/errors.py", line 165, in call
xinference-1 | 2025-02-19T03:24:29.607859453Z await self.app(scope, receive, _send)
xinference-1 | 2025-02-19T03:24:29.607862159Z File "/opt/conda/lib/python3.11/site-packages/aioprometheus/asgi/middleware.py", line 184, in call
xinference-1 | 2025-02-19T03:24:29.607865029Z await self.asgi_callable(scope, receive, wrapped_send)
xinference-1 | 2025-02-19T03:24:29.607867796Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/cors.py", line 93, in call
xinference-1 | 2025-02-19T03:24:29.607870849Z await self.simple_response(scope, receive, send, request_headers=headers)
xinference-1 | 2025-02-19T03:24:29.607873773Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/cors.py", line 144, in simple_response
xinference-1 | 2025-02-19T03:24:29.607876630Z await self.app(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607880393Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/exceptions.py", line 62, in call
xinference-1 | 2025-02-19T03:24:29.607883469Z await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607886225Z File "/opt/conda/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.607889100Z raise exc
xinference-1 | 2025-02-19T03:24:29.607892102Z File "/opt/conda/lib/python3.11/site-packages/starlette/_exception_handler.py", line 42, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.607895124Z await app(scope, receive, sender)
xinference-1 | 2025-02-19T03:24:29.607898066Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 715, in call
xinference-1 | 2025-02-19T03:24:29.607900891Z await self.middleware_stack(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607903593Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 735, in app
xinference-1 | 2025-02-19T03:24:29.607906376Z await route.handle(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607909350Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 460, in handle
xinference-1 | 2025-02-19T03:24:29.607912329Z await self.app(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607914967Z File "/opt/conda/lib/python3.11/site-packages/fastapi/applications.py", line 1054, in call
xinference-1 | 2025-02-19T03:24:29.607917864Z await super().call(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607920645Z File "/opt/conda/lib/python3.11/site-packages/starlette/applications.py", line 112, in call
xinference-1 | 2025-02-19T03:24:29.607925734Z await self.middleware_stack(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607928501Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/errors.py", line 187, in call
xinference-1 | 2025-02-19T03:24:29.607934797Z raise exc
xinference-1 | 2025-02-19T03:24:29.607937587Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/errors.py", line 165, in call
xinference-1 | 2025-02-19T03:24:29.607940626Z await self.app(scope, receive, _send)
xinference-1 | 2025-02-19T03:24:29.607943342Z File "/opt/conda/lib/python3.11/site-packages/gradio/route_utils.py", line 695, in call
xinference-1 | 2025-02-19T03:24:29.607949537Z await self.simple_response(scope, receive, send, request_headers=headers)
xinference-1 | 2025-02-19T03:24:29.607952599Z File "/opt/conda/lib/python3.11/site-packages/gradio/route_utils.py", line 711, in simple_response
xinference-1 | 2025-02-19T03:24:29.607955459Z await self.app(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607958182Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/exceptions.py", line 62, in call
xinference-1 | 2025-02-19T03:24:29.607961077Z await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607963998Z File "/opt/conda/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.607966842Z raise exc
xinference-1 | 2025-02-19T03:24:29.607969897Z File "/opt/conda/lib/python3.11/site-packages/starlette/_exception_handler.py", line 42, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.607973155Z await app(scope, receive, sender)
xinference-1 | 2025-02-19T03:24:29.607975874Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 715, in call
xinference-1 | 2025-02-19T03:24:29.607978818Z await self.middleware_stack(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607981594Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 735, in app
xinference-1 | 2025-02-19T03:24:29.607984647Z await route.handle(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607987735Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 288, in handle
xinference-1 | 2025-02-19T03:24:29.607990747Z await self.app(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607993884Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 76, in app
xinference-1 | 2025-02-19T03:24:29.607996667Z await wrap_app_handling_exceptions(app, request)(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.607999455Z File "/opt/conda/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.608002323Z raise exc
xinference-1 | 2025-02-19T03:24:29.608005029Z File "/opt/conda/lib/python3.11/site-packages/starlette/exception_handler.py", line 42, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.608008258Z await app(scope, receive, sender)
xinference-1 | 2025-02-19T03:24:29.608011243Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 73, in app
xinference-1 | 2025-02-19T03:24:29.608014047Z response = await f(request)
xinference-1 | 2025-02-19T03:24:29.608016879Z ^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608019759Z File "/opt/conda/lib/python3.11/site-packages/fastapi/routing.py", line 291, in app
xinference-1 | 2025-02-19T03:24:29.608025342Z solved_result = await solve_dependencies(
xinference-1 | 2025-02-19T03:24:29.608028227Z ^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608030922Z File "/opt/conda/lib/python3.11/site-packages/fastapi/dependencies/utils.py", line 666, in solve_dependencies
xinference-1 | 2025-02-19T03:24:29.608033754Z ) = await request_body_to_args( # body_params checked above
xinference-1 | 2025-02-19T03:24:29.608036575Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608039298Z File "/opt/conda/lib/python3.11/site-packages/fastapi/dependencies/utils.py", line 891, in request_body_to_args
xinference-1 | 2025-02-19T03:24:29.608042707Z fields_to_extract = get_cached_model_fields(first_field.type
)
xinference-1 | 2025-02-19T03:24:29.608045546Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608048287Z File "/opt/conda/lib/python3.11/site-packages/fastapi/_compat.py", line 659, in get_cached_model_fields
xinference-1 | 2025-02-19T03:24:29.608051139Z return get_model_fields(model)
xinference-1 | 2025-02-19T03:24:29.608053938Z ^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608056690Z File "/opt/conda/lib/python3.11/site-packages/fastapi/_compat.py", line 285, in get_model_fields
xinference-1 | 2025-02-19T03:24:29.608059802Z return [
xinference-1 | 2025-02-19T03:24:29.608062584Z ^
xinference-1 | 2025-02-19T03:24:29.608065646Z File "/opt/conda/lib/python3.11/site-packages/fastapi/_compat.py", line 286, in
xinference-1 | 2025-02-19T03:24:29.608068711Z ModelField(field_info=field_info, name=name)
xinference-1 | 2025-02-19T03:24:29.608071660Z File "", line 6, in init
xinference-1 | 2025-02-19T03:24:29.608074578Z File "/opt/conda/lib/python3.11/site-packages/fastapi/_compat.py", line 111, in post_init
xinference-1 | 2025-02-19T03:24:29.608077425Z self._type_adapter: TypeAdapter[Any] = TypeAdapter(
xinference-1 | 2025-02-19T03:24:29.608080276Z ^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608083359Z File "/opt/conda/lib/python3.11/site-packages/pydantic/type_adapter.py", line 228, in init
xinference-1 | 2025-02-19T03:24:29.608086340Z self._init_core_attrs(
xinference-1 | 2025-02-19T03:24:29.608089067Z File "/opt/conda/lib/python3.11/site-packages/pydantic/type_adapter.py", line 290, in _init_core_attrs
xinference-1 | 2025-02-19T03:24:29.608091899Z core_schema = schema_generator.generate_schema(self._type)
xinference-1 | 2025-02-19T03:24:29.608094674Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608097387Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 610, in generate_schema
xinference-1 | 2025-02-19T03:24:29.608100486Z schema = self._generate_schema_inner(obj)
xinference-1 | 2025-02-19T03:24:29.608103141Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608106029Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 863, in _generate_schema_inner
xinference-1 | 2025-02-19T03:24:29.608109450Z return self._annotated_schema(obj)
xinference-1 | 2025-02-19T03:24:29.608115138Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608117870Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 1977, in _annotated_schema
xinference-1 | 2025-02-19T03:24:29.608120820Z schema = self._apply_annotations(source_type, annotations)
xinference-1 | 2025-02-19T03:24:29.608123532Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608126259Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 2056, in _apply_annotations
xinference-1 | 2025-02-19T03:24:29.608129069Z schema = get_inner_schema(source_type)
xinference-1 | 2025-02-19T03:24:29.608131804Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608134501Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_schema_generation_shared.py", line 84, in call
xinference-1 | 2025-02-19T03:24:29.608137442Z schema = self._handler(source_type)
xinference-1 | 2025-02-19T03:24:29.608140096Z ^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608142774Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 2131, in new_handler
xinference-1 | 2025-02-19T03:24:29.608145669Z schema = metadata_get_schema(source, get_inner_schema)
xinference-1 | 2025-02-19T03:24:29.608148510Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608151656Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 2127, in
xinference-1 | 2025-02-19T03:24:29.608154888Z lambda source, handler: handler(source)
xinference-1 | 2025-02-19T03:24:29.608157578Z ^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608160400Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_schema_generation_shared.py", line 84, in call
xinference-1 | 2025-02-19T03:24:29.608163574Z schema = self._handler(source_type)
xinference-1 | 2025-02-19T03:24:29.608166646Z ^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608169374Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 2037, in inner_handler
xinference-1 | 2025-02-19T03:24:29.608172593Z schema = self._generate_schema_inner(obj)
xinference-1 | 2025-02-19T03:24:29.608175454Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608178170Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 884, in _generate_schema_inner
xinference-1 | 2025-02-19T03:24:29.608181138Z return self.match_type(obj)
xinference-1 | 2025-02-19T03:24:29.608184116Z ^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608186842Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 986, in match_type
xinference-1 | 2025-02-19T03:24:29.608189631Z return self._match_generic_type(obj, origin)
xinference-1 | 2025-02-19T03:24:29.608192607Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608195597Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 1014, in _match_generic_type
xinference-1 | 2025-02-19T03:24:29.608201579Z return self._union_schema(obj)
xinference-1 | 2025-02-19T03:24:29.608204276Z ^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608207394Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 1325, in _union_schema
xinference-1 | 2025-02-19T03:24:29.608210383Z choices.append(self.generate_schema(arg))
xinference-1 | 2025-02-19T03:24:29.608213035Z ^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608215778Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 610, in generate_schema
xinference-1 | 2025-02-19T03:24:29.608218618Z schema = self._generate_schema_inner(obj)
xinference-1 | 2025-02-19T03:24:29.608221360Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608224342Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 884, in _generate_schema_inner
xinference-1 | 2025-02-19T03:24:29.608227557Z return self.match_type(obj)
xinference-1 | 2025-02-19T03:24:29.608230235Z ^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608233013Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 995, in match_type
xinference-1 | 2025-02-19T03:24:29.608236274Z return self._unknown_type_schema(obj)
xinference-1 | 2025-02-19T03:24:29.608239340Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.608242419Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 513, in _unknown_type_schema
xinference-1 | 2025-02-19T03:24:29.608245302Z raise PydanticSchemaGenerationError(
xinference-1 | 2025-02-19T03:24:29.608247996Z pydantic.errors.PydanticSchemaGenerationError: Unable to generate pydantic-core schema for <class 'starlette.requests.Request'>. Set arbitrary_types_allowed=True in the model_config to ignore this error or implement __get_pydantic_core_schema__ on your type to fully support it.
xinference-1 | 2025-02-19T03:24:29.608251490Z
xinference-1 | 2025-02-19T03:24:29.608254314Z If you got this error by calling handler() within __get_pydantic_core_schema__ then you likely need to call handler.generate_schema(<some type>) since we do not call __get_pydantic_core_schema__ on <some type> otherwise to avoid infinite recursion.
xinference-1 | 2025-02-19T03:24:29.608257829Z
xinference-1 | 2025-02-19T03:24:29.608260446Z For further information visit https://errors.pydantic.dev/2.10/u/schema-for-unknown-type
xinference-1 | 2025-02-19T03:24:29.681782473Z 2025-02-19 11:24:29,679 xinference.api.restful_api 1 ERROR Handling request http://localhost:9997/deepseek-r1-distill-qwen/run/predict failed: Unable to generate pydantic-core schema for <class 'starlette.requests.Request'>. Set arbitrary_types_allowed=True in the model_config to ignore this error or implement __get_pydantic_core_schema__ on your type to fully support it.
xinference-1 | 2025-02-19T03:24:29.681807797Z
xinference-1 | 2025-02-19T03:24:29.681811478Z If you got this error by calling handler() within __get_pydantic_core_schema__ then you likely need to call handler.generate_schema(<some type>) since we do not call __get_pydantic_core_schema__ on <some type> otherwise to avoid infinite recursion.
xinference-1 | 2025-02-19T03:24:29.681823239Z
xinference-1 | 2025-02-19T03:24:29.681826152Z For further information visit https://errors.pydantic.dev/2.10/u/schema-for-unknown-type
xinference-1 | 2025-02-19T03:24:29.681829136Z Traceback (most recent call last):
xinference-1 | 2025-02-19T03:24:29.681831896Z File "/opt/conda/lib/python3.11/site-packages/pydantic/type_adapter.py", line 271, in _init_core_attrs
xinference-1 | 2025-02-19T03:24:29.681834894Z self.core_schema = _getattr_no_parents(self._type, 'pydantic_core_schema')
xinference-1 | 2025-02-19T03:24:29.681837748Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.681840755Z File "/opt/conda/lib/python3.11/site-packages/pydantic/type_adapter.py", line 55, in _getattr_no_parents
xinference-1 | 2025-02-19T03:24:29.681843726Z raise AttributeError(attribute)
xinference-1 | 2025-02-19T03:24:29.681846454Z AttributeError: pydantic_core_schema
xinference-1 | 2025-02-19T03:24:29.681849266Z
xinference-1 | 2025-02-19T03:24:29.681853714Z During handling of the above exception, another exception occurred:
xinference-1 | 2025-02-19T03:24:29.681856762Z
xinference-1 | 2025-02-19T03:24:29.681859561Z Traceback (most recent call last):
xinference-1 | 2025-02-19T03:24:29.681862577Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/errors.py", line 165, in call
xinference-1 | 2025-02-19T03:24:29.681865750Z await self.app(scope, receive, _send)
xinference-1 | 2025-02-19T03:24:29.681868584Z File "/opt/conda/lib/python3.11/site-packages/aioprometheus/asgi/middleware.py", line 184, in call
xinference-1 | 2025-02-19T03:24:29.681871656Z await self.asgi_callable(scope, receive, wrapped_send)
xinference-1 | 2025-02-19T03:24:29.681874505Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/cors.py", line 93, in call
xinference-1 | 2025-02-19T03:24:29.681877533Z await self.simple_response(scope, receive, send, request_headers=headers)
xinference-1 | 2025-02-19T03:24:29.681880324Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/cors.py", line 144, in simple_response
xinference-1 | 2025-02-19T03:24:29.681883293Z await self.app(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681886885Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/exceptions.py", line 62, in call
xinference-1 | 2025-02-19T03:24:29.681890336Z await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681893445Z File "/opt/conda/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.681896362Z raise exc
xinference-1 | 2025-02-19T03:24:29.681899098Z File "/opt/conda/lib/python3.11/site-packages/starlette/_exception_handler.py", line 42, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.681902068Z await app(scope, receive, sender)
xinference-1 | 2025-02-19T03:24:29.681905280Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 715, in call
xinference-1 | 2025-02-19T03:24:29.681908614Z await self.middleware_stack(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681911409Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 735, in app
xinference-1 | 2025-02-19T03:24:29.681917489Z await route.handle(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681920266Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 460, in handle
xinference-1 | 2025-02-19T03:24:29.681923286Z await self.app(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681926056Z File "/opt/conda/lib/python3.11/site-packages/fastapi/applications.py", line 1054, in call
xinference-1 | 2025-02-19T03:24:29.681928958Z await super().call(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681931688Z File "/opt/conda/lib/python3.11/site-packages/starlette/applications.py", line 112, in call
xinference-1 | 2025-02-19T03:24:29.681934778Z await self.middleware_stack(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681937560Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/errors.py", line 187, in call
xinference-1 | 2025-02-19T03:24:29.681940435Z raise exc
xinference-1 | 2025-02-19T03:24:29.681943108Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/errors.py", line 165, in call
xinference-1 | 2025-02-19T03:24:29.681945942Z await self.app(scope, receive, _send)
xinference-1 | 2025-02-19T03:24:29.681948660Z File "/opt/conda/lib/python3.11/site-packages/gradio/route_utils.py", line 695, in call
xinference-1 | 2025-02-19T03:24:29.681951625Z await self.simple_response(scope, receive, send, request_headers=headers)
xinference-1 | 2025-02-19T03:24:29.681954861Z File "/opt/conda/lib/python3.11/site-packages/gradio/route_utils.py", line 711, in simple_response
xinference-1 | 2025-02-19T03:24:29.681957804Z await self.app(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681960591Z File "/opt/conda/lib/python3.11/site-packages/starlette/middleware/exceptions.py", line 62, in call
xinference-1 | 2025-02-19T03:24:29.681963519Z await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681966572Z File "/opt/conda/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.681969577Z raise exc
xinference-1 | 2025-02-19T03:24:29.681972516Z File "/opt/conda/lib/python3.11/site-packages/starlette/_exception_handler.py", line 42, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.681975509Z await app(scope, receive, sender)
xinference-1 | 2025-02-19T03:24:29.681978368Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 715, in call
xinference-1 | 2025-02-19T03:24:29.681981241Z await self.middleware_stack(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681983975Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 735, in app
xinference-1 | 2025-02-19T03:24:29.681987032Z await route.handle(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681990109Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 288, in handle
xinference-1 | 2025-02-19T03:24:29.681993023Z await self.app(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.681995745Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 76, in app
xinference-1 | 2025-02-19T03:24:29.682002974Z await wrap_app_handling_exceptions(app, request)(scope, receive, send)
xinference-1 | 2025-02-19T03:24:29.682005968Z File "/opt/conda/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.682008858Z raise exc
xinference-1 | 2025-02-19T03:24:29.682011524Z File "/opt/conda/lib/python3.11/site-packages/starlette/exception_handler.py", line 42, in wrapped_app
xinference-1 | 2025-02-19T03:24:29.682022450Z await app(scope, receive, sender)
xinference-1 | 2025-02-19T03:24:29.682025739Z File "/opt/conda/lib/python3.11/site-packages/starlette/routing.py", line 73, in app
xinference-1 | 2025-02-19T03:24:29.682028802Z response = await f(request)
xinference-1 | 2025-02-19T03:24:29.682031658Z ^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682034401Z File "/opt/conda/lib/python3.11/site-packages/fastapi/routing.py", line 291, in app
xinference-1 | 2025-02-19T03:24:29.682037993Z solved_result = await solve_dependencies(
xinference-1 | 2025-02-19T03:24:29.682042153Z ^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682046496Z File "/opt/conda/lib/python3.11/site-packages/fastapi/dependencies/utils.py", line 666, in solve_dependencies
xinference-1 | 2025-02-19T03:24:29.682050471Z ) = await request_body_to_args( # body_params checked above
xinference-1 | 2025-02-19T03:24:29.682054859Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682058985Z File "/opt/conda/lib/python3.11/site-packages/fastapi/dependencies/utils.py", line 891, in request_body_to_args
xinference-1 | 2025-02-19T03:24:29.682063587Z fields_to_extract = get_cached_model_fields(first_field.type
)
xinference-1 | 2025-02-19T03:24:29.682067717Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682072233Z File "/opt/conda/lib/python3.11/site-packages/fastapi/_compat.py", line 659, in get_cached_model_fields
xinference-1 | 2025-02-19T03:24:29.682076782Z return get_model_fields(model)
xinference-1 | 2025-02-19T03:24:29.682080951Z ^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682085403Z File "/opt/conda/lib/python3.11/site-packages/fastapi/_compat.py", line 285, in get_model_fields
xinference-1 | 2025-02-19T03:24:29.682089934Z return [
xinference-1 | 2025-02-19T03:24:29.682094090Z ^
xinference-1 | 2025-02-19T03:24:29.682098819Z File "/opt/conda/lib/python3.11/site-packages/fastapi/_compat.py", line 286, in
xinference-1 | 2025-02-19T03:24:29.682103794Z ModelField(field_info=field_info, name=name)
xinference-1 | 2025-02-19T03:24:29.682108439Z File "", line 6, in init
xinference-1 | 2025-02-19T03:24:29.682112967Z File "/opt/conda/lib/python3.11/site-packages/fastapi/_compat.py", line 111, in post_init
xinference-1 | 2025-02-19T03:24:29.682117035Z self._type_adapter: TypeAdapter[Any] = TypeAdapter(
xinference-1 | 2025-02-19T03:24:29.682121223Z ^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682125386Z File "/opt/conda/lib/python3.11/site-packages/pydantic/type_adapter.py", line 228, in init
xinference-1 | 2025-02-19T03:24:29.682134106Z self._init_core_attrs(
xinference-1 | 2025-02-19T03:24:29.682138197Z File "/opt/conda/lib/python3.11/site-packages/pydantic/type_adapter.py", line 290, in _init_core_attrs
xinference-1 | 2025-02-19T03:24:29.682142605Z core_schema = schema_generator.generate_schema(self._type)
xinference-1 | 2025-02-19T03:24:29.682146873Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682151223Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 610, in generate_schema
xinference-1 | 2025-02-19T03:24:29.682157122Z schema = self._generate_schema_inner(obj)
xinference-1 | 2025-02-19T03:24:29.682161770Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682166196Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 863, in _generate_schema_inner
xinference-1 | 2025-02-19T03:24:29.682170728Z return self._annotated_schema(obj)
xinference-1 | 2025-02-19T03:24:29.682174457Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682177144Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 1977, in _annotated_schema
xinference-1 | 2025-02-19T03:24:29.682180014Z schema = self._apply_annotations(source_type, annotations)
xinference-1 | 2025-02-19T03:24:29.682183009Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682185945Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 2056, in _apply_annotations
xinference-1 | 2025-02-19T03:24:29.682188782Z schema = get_inner_schema(source_type)
xinference-1 | 2025-02-19T03:24:29.682191460Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682194445Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_schema_generation_shared.py", line 84, in call
xinference-1 | 2025-02-19T03:24:29.682197698Z schema = self._handler(source_type)
xinference-1 | 2025-02-19T03:24:29.682200695Z ^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682203380Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 2131, in new_handler
xinference-1 | 2025-02-19T03:24:29.682206203Z schema = metadata_get_schema(source, get_inner_schema)
xinference-1 | 2025-02-19T03:24:29.682209933Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682213318Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 2127, in
xinference-1 | 2025-02-19T03:24:29.682216370Z lambda source, handler: handler(source)
xinference-1 | 2025-02-19T03:24:29.682219071Z ^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682222115Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_schema_generation_shared.py", line 84, in call
xinference-1 | 2025-02-19T03:24:29.682225078Z schema = self._handler(source_type)
xinference-1 | 2025-02-19T03:24:29.682229364Z ^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682233733Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 2037, in inner_handler
xinference-1 | 2025-02-19T03:24:29.682243063Z schema = self._generate_schema_inner(obj)
xinference-1 | 2025-02-19T03:24:29.682248318Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682252836Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 884, in _generate_schema_inner
xinference-1 | 2025-02-19T03:24:29.682256528Z return self.match_type(obj)
xinference-1 | 2025-02-19T03:24:29.682259564Z ^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682262220Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 986, in match_type
xinference-1 | 2025-02-19T03:24:29.682265528Z return self._match_generic_type(obj, origin)
xinference-1 | 2025-02-19T03:24:29.682268590Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682271461Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 1014, in _match_generic_type
xinference-1 | 2025-02-19T03:24:29.682274737Z return self._union_schema(obj)
xinference-1 | 2025-02-19T03:24:29.682277615Z ^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682280337Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 1325, in _union_schema
xinference-1 | 2025-02-19T03:24:29.682283495Z choices.append(self.generate_schema(arg))
xinference-1 | 2025-02-19T03:24:29.682286493Z ^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682289190Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 610, in generate_schema
xinference-1 | 2025-02-19T03:24:29.682294993Z schema = self._generate_schema_inner(obj)
xinference-1 | 2025-02-19T03:24:29.682297979Z ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xinference-1 | 2025-02-19T03:24:29.682300687Z File "/opt/conda/lib/python3.11/site-packages/pydantic/_internal/_generate_schema.py", line 884, in _generate_schema_inner
xinference-1 | 2025-02-19T03:24:29.682303822Z return self.match_type(obj)
xinference-1 | 2025-02-19T03:24:29.682306871Z ^^^^^^^^^^^^^^^^^^^^

Expected behavior / 期待表现

测试信息应正常返回结果

@XprobeBot XprobeBot added the gpu label Feb 19, 2025
@XprobeBot XprobeBot added this to the v1.x milestone Feb 19, 2025
@SharkSyl
Copy link
Author

gradio                         3.50.2
gradio_client                  0.6.1
pydantic                       1.10.21
pydantic_core                  0.42.0

我尝试将gradio降低到<4.0和pydantic<2.0版本,解决了这个问题
这个问题,可能是transformes作为engine需要pydantic<2.0引起的

pip install "gradio<4.0"
pip install "pydantic_core<2"

这个情况会在官方镜像得到优化吗?

@SharkSyl
Copy link
Author

提示:you need autoawq>0.6.2
不过我查了下 https://github.com/casper-hansen/AutoAWQ 的最新版本是0.2.8
请问呢这个是我理解错了吗?

 xinference launch --model_path /root/.cache/huggingface/hub/deepseek-r1-distill-qwen-1.5b-awq --model-engine transformers --model-name deepseek-r1-distill-qwen --size-in-billions 1_5 --model-format awq  --quantization Int4
(base) root@6124cea6235e:/# xinference launch --model_path /root/.cache/huggingface/hub/deepseek-r1-distill-qwen-1.5b-awq --model-engine transformers --model-name deepseek-r1-distill-qwen --size-in-billions 1_5 --model-format awq  --quantization Int4
Launch model name: deepseek-r1-distill-qwen with kwargs: {'model_path': '/root/.cache/huggingface/hub/deepseek-r1-distill-qwen-1.5b-awq'}
Traceback (most recent call last):
  File "/opt/conda/bin/xinference", line 8, in <module>
    sys.exit(cli())
             ^^^^^
  File "/opt/conda/lib/python3.11/site-packages/click/core.py", line 1161, in __call__
    return self.main(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/site-packages/click/core.py", line 1082, in main
    rv = self.invoke(ctx)
         ^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/site-packages/click/core.py", line 1697, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/site-packages/click/core.py", line 1443, in invoke
    return ctx.invoke(self.callback, **ctx.params)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/site-packages/click/core.py", line 788, in invoke
    return __callback(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/site-packages/click/decorators.py", line 33, in new_func
    return f(get_current_context(), *args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/site-packages/xinference/deploy/cmdline.py", line 908, in model_launch
    model_uid = client.launch_model(
                ^^^^^^^^^^^^^^^^^^^^
  File "/opt/conda/lib/python3.11/site-packages/xinference/client/restful/restful_client.py", line 999, in launch_model
    raise RuntimeError(
RuntimeError: Failed to launch model, detail: [address=0.0.0.0:39895, pid=1087] To use IPEX backend, you need autoawq>0.6.2. Please install the latest version or from source.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants