Add mark_step for encoder layers #650

yisonzhu · 2024-12-19T08:31:29Z

Coupled with [Use FusedSDPA for MllamaVisionSdpaAttention #620], these two issues arising when running llama3.2 vision model can be resolved:

GC fail when batchsize>1 on Gaudi3.
Increased device memory consumption with Torch 2.5 compared to Torch 2.4.

jkaniecki

LGTM

michalkuligowski · 2025-01-07T09:27:06Z

vllm/worker/hpu_model_runner.py

@@ -135,7 +135,7 @@ def flatten(in_list):
    return list(itertools.chain(*in_list))


-def get_decoder_layer_suffix(model_type):
+def get_target_layer_suffix(model_type) -> list[str]:


Please change to something like "get_target_layer_suffix_list" to reflect return type

michalkuligowski · 2025-01-07T09:27:25Z

vllm/worker/hpu_model_runner.py

-                         n=1,
-                         counter=None):
+def modify_model_layers(module: torch.nn.Module,
+                        suffix: list[str],


Please change to something like "suffix_list" to reflect the underlying type

yma11 · 2025-01-08T03:29:58Z

Hi @michalkuligowski, thanks for your review but I need to create a new PR #669 to address your comments. Can you help close this one and take a look at the new one?

This is a updated version from #650. Coupled with [Use FusedSDPA for MllamaVisionSdpaAttention #620], these two issues arising when running llama3.2 vision model can be resolved: GC fail when batchsize>1 on Gaudi3. Increased device memory consumption with Torch 2.5 compared to Torch 2.4. --------- Signed-off-by: yan ma <[email protected]> Co-authored-by: yisonzhu <[email protected]>

yisonzhu requested review from kzawora-intel, madamczykhabana, michalkuligowski and mgawarkiewicz as code owners December 19, 2024 08:31

jkaniecki approved these changes Dec 20, 2024

View reviewed changes

yisonzhu requested a review from vivekgoe as a code owner December 25, 2024 04:42

add mark_step for decoder layers

5efc637

michalkuligowski force-pushed the dev/mark_step_for_encoder branch from c2f7554 to 5efc637 Compare January 7, 2025 09:18

michalkuligowski requested changes Jan 7, 2025

View reviewed changes

yma11 mentioned this pull request Jan 8, 2025

Add mark_step for encoder layers #669

Merged

michalkuligowski closed this Jan 8, 2025

michalkuligowski deleted the dev/mark_step_for_encoder branch January 8, 2025 09:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add mark_step for encoder layers #650

Add mark_step for encoder layers #650

yisonzhu commented Dec 19, 2024 •

edited by github-actions bot

Loading

jkaniecki left a comment

michalkuligowski Jan 7, 2025

michalkuligowski Jan 7, 2025

yma11 commented Jan 8, 2025

Add mark_step for encoder layers #650

Add mark_step for encoder layers #650

Conversation

yisonzhu commented Dec 19, 2024 • edited by github-actions bot Loading

jkaniecki left a comment

Choose a reason for hiding this comment

michalkuligowski Jan 7, 2025

Choose a reason for hiding this comment

michalkuligowski Jan 7, 2025

Choose a reason for hiding this comment

yma11 commented Jan 8, 2025

yisonzhu commented Dec 19, 2024 •

edited by github-actions bot

Loading