Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

加载本地embedding模型文件出错 #2855

Open
1 of 3 tasks
Tiaonmmn opened this issue Feb 13, 2025 · 0 comments
Open
1 of 3 tasks

加载本地embedding模型文件出错 #2855

Tiaonmmn opened this issue Feb 13, 2025 · 0 comments
Labels
Milestone

Comments

@Tiaonmmn
Copy link

Tiaonmmn commented Feb 13, 2025

System Info / 系統信息

pip list

Package                           Version
--------------------------------- ------------
accelerate                        1.3.0
aiofiles                          23.2.1
aiohappyeyeballs                  2.4.6
aiohttp                           3.10.11
aioprometheus                     23.12.0
aiosignal                         1.3.2
annotated-types                   0.7.0
anyio                             4.8.0
async-timeout                     5.0.1
attrs                             25.1.0
audioread                         3.0.1
bcrypt                            4.2.1
certifi                           2025.1.31
cffi                              1.17.1
charset-normalizer                3.4.1
click                             8.1.8
cloudpickle                       3.1.1
cryptography                      44.0.0
datasets                          3.2.0
decorator                         5.1.1
dill                              0.3.8
diskcache                         5.6.3
distro                            1.9.0
ecdsa                             0.19.0
fastapi                           0.115.8
ffmpy                             0.5.0
filelock                          3.17.0
frozenlist                        1.5.0
fsspec                            2024.9.0
gguf                              0.9.1
gradio                            5.3.0
gradio_client                     1.4.2
h11                               0.14.0
httpcore                          1.0.7
httptools                         0.6.4
httpx                             0.28.1
huggingface-hub                   0.25.0
idna                              3.10
importlib_metadata                8.6.1
interegular                       0.3.3
Jinja2                            3.1.5
jiter                             0.8.2
joblib                            1.4.2
jsonschema                        4.23.0
jsonschema-specifications         2024.10.1
lark                              1.2.2
lazy_loader                       0.4
librosa                           0.10.2.post1
llvmlite                          0.44.0
lm-format-enforcer                0.10.6
markdown-it-py                    3.0.0
MarkupSafe                        2.1.5
mdurl                             0.1.2
modelscope                        1.22.3
mpmath                            1.3.0
msgpack                           1.1.0
msgspec                           0.18.6
multidict                         6.1.0
multiprocess                      0.70.16
nest-asyncio                      1.6.0
networkx                          3.4.2
nltk                              3.9.1
numba                             0.61.0
numpy                             1.26.4
nvidia-cublas-cu12                12.1.3.1
nvidia-cuda-cupti-cu12            12.1.105
nvidia-cuda-nvrtc-cu12            12.1.105
nvidia-cuda-runtime-cu12          12.1.105
nvidia-cudnn-cu12                 9.1.0.70
nvidia-cufft-cu12                 11.0.2.54
nvidia-curand-cu12                10.3.2.106
nvidia-cusolver-cu12              11.4.5.107
nvidia-cusparse-cu12              12.1.0.106
nvidia-ml-py                      12.570.86
nvidia-nccl-cu12                  2.20.5
nvidia-nvjitlink-cu12             12.4.127
nvidia-nvtx-cu12                  12.1.105
openai                            1.61.1
orjson                            3.10.15
outlines                          0.0.46
packaging                         24.2
pandas                            2.2.3
passlib                           1.7.4
peft                              0.14.0
pillow                            10.4.0
pip                               25.0
platformdirs                      4.3.6
pooch                             1.8.2
prometheus_client                 0.21.1
prometheus-fastapi-instrumentator 7.0.2
protobuf                          5.29.3
psutil                            6.1.1
py-cpuinfo                        9.0.0
pyairports                        2.1.1
pyarrow                           19.0.0
pyasn1                            0.6.1
pycountry                         24.6.1
pycparser                         2.22
pydantic                          2.10.6
pydantic_core                     2.27.2
pydub                             0.25.1
Pygments                          2.19.1
python-dateutil                   2.9.0.post0
python-dotenv                     1.0.1
python-jose                       3.3.0
python-multipart                  0.0.20
pytz                              2025.1
PyYAML                            6.0.2
pyzmq                             26.2.1
quantile-python                   1.1
ray                               2.42.1
referencing                       0.36.2
regex                             2024.11.6
requests                          2.32.3
rich                              13.9.4
rpds-py                           0.22.3
rsa                               4.9
ruff                              0.9.6
safetensors                       0.5.2
scikit-learn                      1.6.1
scipy                             1.15.1
semantic-version                  2.10.0
sentence-transformers             3.1.0
sentencepiece                     0.2.0
setproctitle                      1.3.4
setuptools                        75.8.0
shellingham                       1.5.4
six                               1.17.0
sniffio                           1.3.1
soundfile                         0.13.1
soxr                              0.5.0.post1
sse-starlette                     2.2.1
starlette                         0.45.3
sympy                             1.13.1
tabulate                          0.9.0
tblib                             3.0.0
threadpoolctl                     3.5.0
tiktoken                          0.8.0
timm                              1.0.14
tokenizers                        0.21.0
tomlkit                           0.12.0
torch                             2.4.0
torchvision                       0.19.0
tqdm                              4.67.1
transformers                      4.48.3
triton                            3.0.0
typer                             0.15.1
typing_extensions                 4.12.2
tzdata                            2025.1
urllib3                           2.3.0
uvicorn                           0.34.0
uvloop                            0.21.0
vllm                              0.5.5
vllm-flash-attn                   2.6.1
watchfiles                        1.0.4
websockets                        12.0
wheel                             0.45.1
xformers                          0.0.27.post2
xinference                        1.2.2
xoscar                            0.4.6
xxhash                            3.5.0
yarl                              1.13.1
zipp                              3.21.0

Running Xinference with Docker? / 是否使用 Docker 运行 Xinfernece?

  • docker / docker
  • pip install / 通过 pip install 安装
  • installation from source / 从源码安装

Version info / 版本信息

1.2.2

The command used to start Xinference / 用以启动 xinference 的命令

xinference launch --model_path /data/bge-m3 -n bge-m3 -t embedding

Reproduction / 复现过程

在运行了xinference-local的前提下,运行xinference launch --model_path /data/bge-m3/ -n bge-m3 -t embedding命令报错:

Launch model name: bge-m3 with kwargs: {'model_path': '/data/bge-m3/'}
Traceback (most recent call last):
  File "/data/anaconda3/envs/xinference/bin/xinference", line 8, in <module>
    sys.exit(cli())
             ^^^^^
  File "/data/anaconda3/envs/xinference/lib/python3.11/site-packages/click/core.py", line 1161, in __call__
    return self.main(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/data/anaconda3/envs/xinference/lib/python3.11/site-packages/click/core.py", line 1082, in main
    rv = self.invoke(ctx)
         ^^^^^^^^^^^^^^^^
  File "/data/anaconda3/envs/xinference/lib/python3.11/site-packages/click/core.py", line 1697, in invoke
    return _process_result(sub_ctx.command.invoke(sub_ctx))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/data/anaconda3/envs/xinference/lib/python3.11/site-packages/click/core.py", line 1443, in invoke
    return ctx.invoke(self.callback, **ctx.params)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/data/anaconda3/envs/xinference/lib/python3.11/site-packages/click/core.py", line 788, in invoke
    return __callback(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/data/anaconda3/envs/xinference/lib/python3.11/site-packages/click/decorators.py", line 33, in new_func
    return f(get_current_context(), *args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/data/anaconda3/envs/xinference/lib/python3.11/site-packages/xinference/deploy/cmdline.py", line 908, in model_launch
    model_uid = client.launch_model(
                ^^^^^^^^^^^^^^^^^^^^
  File "/data/anaconda3/envs/xinference/lib/python3.11/site-packages/xinference/client/restful/restful_client.py", line 999, in launch_model
    raise RuntimeError(
RuntimeError: Failed to launch model, detail: [address=0.0.0.0:41303, pid=1830864] Repo id must be in the form 'repo_name' or 'namespace/repo_name': '/data/bge-m3/'. Use `repo_type` argument if needed.

Expected behavior / 期待表现

embedding模型正常工作

@XprobeBot XprobeBot added the gpu label Feb 13, 2025
@XprobeBot XprobeBot added this to the v1.x milestone Feb 13, 2025
@Tiaonmmn Tiaonmmn changed the title 运行本地embedding模型出错 加载本地embedding模型文件出错 Feb 13, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants