You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! I'm running vllm:0.7.2. When I tried to generate JSON, it was producing a lot of new lines. With Qwen2.5-72b, more than 1,000 tokens were new lines, and for 7b, the response was:
{
"clarifying_question": "Do you mean that you need to quickly deploy or start something (like an app, service, etc.) for $10?",
"cost_per_serving": "$10",
"calories": "",
"type_dish_ids": ""
,
"type_meal_ids": ""
,
"product_ids": [
"quick_launch_service"
],
"exclude_product_ids": [
""],
"allergen_ids": [
""]
, "total_cooking_time": "",
"kitchen_ids": "",
"holiday_ids": ""
}
The JSON schema was generated using a Pydantic model:
Hi! I'm running
vllm:0.7.2
. When I tried to generate JSON, it was producing a lot of new lines. WithQwen2.5-72b
, more than 1,000 tokens were new lines, and for7b
, the response was:The JSON schema was generated using a Pydantic model:
The text was updated successfully, but these errors were encountered: