Skip to content

Commit

Permalink
[NPU] Hot fix of load_low_bit (#12344)
Browse files Browse the repository at this point in the history
  • Loading branch information
plusbang authored Nov 6, 2024
1 parent 899a303 commit 69e3a56
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion python/llm/src/ipex_llm/transformers/npu_model.py
Original file line number Diff line number Diff line change
Expand Up @@ -549,7 +549,7 @@ def load_low_bit(cls, pretrained_model_name_or_path: str, *model_args, **kwargs)
from ipex_llm.transformers.npu_models.convert_mp import optimize_llm
optimize_llm(
llm,
max_output_len=max_context_len,
max_context_len=max_context_len,
max_prompt_len=max_prompt_len,
inter_pp=inter_pp,
intra_pp=intra_pp,
Expand Down

0 comments on commit 69e3a56

Please sign in to comment.