Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

服务器部署chatglm3模型后,用postman开两个窗口几乎同时调用/v1/chat/completions这个接口(流式输出),最后回答的文本时看不懂的文字,如下图,平时问答是几秒,现在响应时间持续几十秒,输出的都是看不懂的文字 #1327

Closed
levorge opened this issue Oct 16, 2024 · 1 comment
Assignees

Comments

@levorge
Copy link

levorge commented Oct 16, 2024

Feature request / 功能建议

image
image
后面输出的全是stx字样

Motivation / 动机

暂无

Your contribution / 您的贡献

暂无

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR self-assigned this Jan 6, 2025
@zRzRzRzRzRzRzR
Copy link
Member

这套方案是不支持并发的,并发就会出现那个问题。我们设计这个demo仅考虑了一个用户一次调用

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR closed this as not planned Won't fix, can't repro, duplicate, stale Jan 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants