Extra new lines for json generation #212

Samoed · 2025-02-20T14:23:00Z

Hi! I'm running vllm:0.7.2. When I tried to generate JSON, it was producing a lot of new lines. With Qwen2.5-72b, more than 1,000 tokens were new lines, and for 7b, the response was:

{
    "clarifying_question": "Do you mean that you need to quickly deploy or start something (like an app, service, etc.) for $10?",
    "cost_per_serving": "$10",
    "calories": "",
    "type_dish_ids": ""
    ,
    "type_meal_ids": ""
    ,
    "product_ids": [
        "quick_launch_service"
    ],
    "exclude_product_ids": [
        
    
    ""],
    "allergen_ids": [
        
    ""]
    
    , "total_cooking_time": "",

    "kitchen_ids": "",

    "holiday_ids": ""

}

The JSON schema was generated using a Pydantic model:

from datetime import datetime
from langchain_openai import ChatOpenAI
from pydantic import BaseModel


class ResponseSchema(BaseModel):
    clarifying_question: str
    cost_per_serving: str
    calories: str
    type_dish_ids: str
    type_meal_ids: str
    product_ids: list[str]
    exclude_product_ids: list[str]
    allergen_ids: list[str]
    total_cooking_time: str
    kitchen_ids: str
    holiday_ids: str


def main():
    model = ChatOpenAI(
        model="qwen2.5-7b",
        base_url="http://localhost:8000/v1/",
        openai_api_key="test",
        temperature=0,
        extra_body={
            "repetition_penalty": 1.3,
            "presence_penalty": -1.1,
            "frequency_penalty": 0,
            "max_tokens": 2_000,
            "guided_json": ResponseSchema.model_json_schema(),
        },
    )

    query = "I want a quick launch fast with $10."
    RECIPE_PROMPT = [{"role": "user", "content": query}]
    
    print(datetime.now())
    structured_output = model.invoke(RECIPE_PROMPT)
    print(structured_output.content)
    print(ResponseSchema.model_validate_json(structured_output.content).model_dump())


if __name__ == "__main__":
    main()

The text was updated successfully, but these errors were encountered:

russellb · 2025-02-20T14:27:02Z

This PR to vLLM should resolve this issue: vllm-project/vllm#12744

Samoed · 2025-02-20T14:50:04Z

I see that was fixed in #123. I'll close for now. Thanks for quick response!

Samoed closed this as completed Feb 20, 2025

Samoed mentioned this issue Feb 20, 2025

[Bugfix] Backend option to disable xgrammar any_whitespace vllm-project/vllm#12744

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extra new lines for json generation #212

Extra new lines for json generation #212

Samoed commented Feb 20, 2025

russellb commented Feb 20, 2025

Samoed commented Feb 20, 2025

Extra new lines for json generation #212

Extra new lines for json generation #212

Comments

Samoed commented Feb 20, 2025

russellb commented Feb 20, 2025

Samoed commented Feb 20, 2025