VLLM Sampled Tokens #311

AdamBelfki3 · 2024-12-17T21:36:41Z

Fixed the narrowing of the logits module output to reflect accurately the batch_groups.

with vllm_gpt2.trace(temperature=0.0, top_p=1.0, max_tokens=3) as tracer:
    with tracer.invoke(
        [
            "Madison Square Garden is located in the city of", 
            "The Eiffel Tower is located in the city of",
        ]
    ):

        logits_1 = nnsight.list().save()

        for ii in range(3):
            logits_1.append(vllm_gpt2.logits.output)
            vllm_gpt2.logits.next()

    with tracer.invoke("Rome is the capital city of"):
        logits_2 = nnsight.list().save()

        for ii in range(5):
            logits_2.append(vllm_gpt2.logits.output)
            vllm_gpt2.logits.next()

assert all(logit.shape[0] == 2 for logit in logits_1)
assert all(logit.shape[0] == 1 for logit in logits_2)

Previous to this fix, the logit output inside each invoker would contain the logits from all the prompts passed in the entire trace.

Added traceability for sampled tokens. vLLM provides functionality to configure how each sequence samples its next token. Here's an example of how you can trace that operation with the nnsight VLLM wrapper.

with vllm_gpt2.trace("Madison Square Garden is located in the city of", temperature=0.8, top_p=0.95, max_tokens=3) as tracer:
    samples = nnsight.list().save()
    for ii in range(3):
        samples.append(vllm_gpt2.samples.output)
        vllm_gpt2.samples.next()

print(samples)

>>> [tensor([16940]), tensor([319]), tensor([262])]

AdamBelfki3 and others added 4 commits December 17, 2024 13:52

fix (vllm): narrowing logits correctly for each invoker

4f67ca4

feature (vllm): add proper traceability for the vllm sampled tokens

9c8d7eb

feat (docs): document the patching process

6c3bd18

Merge branch '0.4' into vllm-sample

e565ff8

JadenFiotto-Kaufman merged commit 5697d40 into 0.4 Jan 31, 2025
1 check passed

JadenFiotto-Kaufman deleted the vllm-sample branch January 31, 2025 17:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VLLM Sampled Tokens #311

VLLM Sampled Tokens #311

AdamBelfki3 commented Dec 17, 2024

VLLM Sampled Tokens #311

VLLM Sampled Tokens #311

Conversation

AdamBelfki3 commented Dec 17, 2024