InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 430
Star 4.7k

Code
Issues 300
Pull requests 27
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: InternLM/lmdeploy

[Benchmark] benchmarks on different cuda architecture with mo...

#815 opened Dec 11, 2023 by lvhan028

Open 9

A100算力加持！书生大模型实战营第3期全面升级，趣味闯关模式等你开启

#2021 opened Jul 15, 2024 by boshallen

Open

Labels 34 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

300 Open 1,207 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

[Docs] 添加自定义的多模态模型的问题请教

#2790 opened Nov 21, 2024 by llery

[Bug] Does PytorchEngine Visual Model Support Prefix Caching?

#2789 opened Nov 21, 2024 by OftenDream

3 tasks

[Bug] Llama-3.2-1B-Instruct and InternVL2-1B does not supported kvin4, is that expected?

#2786 opened Nov 21, 2024 by zhulinJulia24

3 tasks

[Bug] Response of converted Qwen2-57B-A14B-Instruct-GPTQ-Int4 returns garbled characters

#2785 opened Nov 21, 2024 by zhulinJulia24

3 tasks

[Bug] SystemExit: 1 asyncio.exceptions.TimeoutError

#2782 opened Nov 20, 2024 by LIUKAI0815

2 of 3 tasks

[Bug] Qwen2.5无法跑通tools call(官方案例代码) awaiting response

#2775 opened Nov 20, 2024 by turkeymz

3 tasks done

[Feature] qwen2 vl support the turbomind engine

#2774 opened Nov 20, 2024 by DexterGuo

[Feature] turbomind后端是否会支持guided_decoding

#2771 opened Nov 19, 2024 by shell-nlp

[Bug] The quantization process of Qwen/Qwen2-VL-7B-Instruct is getting killed without throwing error.

#2770 opened Nov 19, 2024 by vjaideep08

3 tasks done

[Bug] 昇腾910B通过lmdeploy镜像，使用qwen2-vl-7b模型，推理过程报错： call aclnnBatchMatMul failed

#2769 opened Nov 18, 2024 by fusmile0101

1 of 3 tasks

How can I specify the rope scaling type when starting the API server?

#2768 opened Nov 18, 2024 by snachx

[Feature] W4A8-FP8 support in AWQ quantization

#2766 opened Nov 18, 2024 by yongchaoding

[Bug] lmdeploy加载lora微调模型报错

#2762 opened Nov 17, 2024 by ltt-gddxz

3 tasks done

[Bug] The script "profile_generation.py" went haywire and crashed

#2760 opened Nov 15, 2024 by yuchiwang

3 tasks done

Why do video frames in lmdeploy need to be converted into base64 encoding?

#2759 opened Nov 15, 2024 by AmazDeng

[Bug] lora微调后一直重复输出

#2757 opened Nov 14, 2024 by lylala8

3 tasks done

[Feature] Support response_format for TurboMind

#2753 opened Nov 13, 2024 by h4n0

[Bug] 0.6.2 vs 0.4.2 qwen1.5b模型，0.6.2推理性能差距有慢3倍

#2752 opened Nov 13, 2024 by xliangwu

1 of 3 tasks

[Feature] 有昇腾平台的模型性能测试数据吗

#2746 opened Nov 13, 2024 by zainlau

[Bug] Cannot install torch-npu==2.3.1, torch==2.3.1 and torchvision==0.18.1 because these package versions have conflicting dependencies.

#2745 opened Nov 13, 2024 by jiabao-wang

3 tasks

[Bug] 似乎卡死的都是VLM模型，看着是个系统性问题？

#2743 opened Nov 13, 2024 by DefTruth

3 tasks

[Feature] Qwen2-VL支持video

#2735 opened Nov 11, 2024 by evi-Genius

[Feature] The cache-max-entry-count working off percentages makes it difficult to setup multiple servers

#2732 opened Nov 9, 2024 by mrakgr

[Bug] Accuracy of W8A8 is big different from that of the original model

#2730 opened Nov 9, 2024 by HelloCard

3 tasks done

[Bug] Deployment of Llama3.1-70b getting struck

#2724 opened Nov 7, 2024 by pulkitmehtaworkmetacube

3 tasks done

Previous 1 2 3 4 5 … 11 12 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly