-
Notifications
You must be signed in to change notification settings - Fork 335
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
跑比较长文本推理报错 #338
Comments
可能是显存不足导致 |
不对,有随机性,感觉是有问题 |
6 多样化的 看着像是生成完了出现的bug,跑了下原版的openai_api代码: import asyncio import chatglm_cpp logging.basicConfig(level=logging.INFO, format=r"%(asctime)s - %(module)s - %(levelname)s - %(message)s") class Settings(BaseSettings): class ToolCallFunction(BaseModel): class ToolCall(BaseModel): class ContentText(BaseModel): class ContentImageUrlData(BaseModel): class ContentImageUrl(BaseModel): class ChatMessage(BaseModel): class DeltaMessage(BaseModel): class ChatCompletionToolFunction(BaseModel): class ChatCompletionTool(BaseModel): class ChatCompletionRequest(BaseModel): class ChatCompletionResponseChoice(BaseModel): class ChatCompletionResponseStreamChoice(BaseModel): class ChatCompletionUsage(BaseModel):
class ChatCompletionResponse(BaseModel):
settings = Settings() def stream_chat(messages, body):
async def stream_chat_event_publisher(history, body): @app.post("/v1/chat/completions")
class ModelCard(BaseModel): class ModelList(BaseModel):
@app.get("/v1/models") if name == "main": |
应该是bug,可以提供下最小复现的例子吗 |
chatglm4-9b-1m模型
批量读取文件推理,随机会报错
GGML_ASSERT: /home/lili/chatglm.cpp/third_party/ggml/src/ggml.c:3596: view_src == NULL || data_size == 0 || data_size + view_offs <= ggml_nbytes(view_src)
Aborted (core dumped)
The text was updated successfully, but these errors were encountered: