fix int8 skip module config #1682

changwangss · 2024-08-05T04:20:11Z

Type of Change

feature or bug fix or documentation or others
API changed or not

Description

detail description
JIRA ticket: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: changwangss <[email protected]>

github-actions · 2024-08-05T04:20:36Z

⚡ Required checks status: All passing 🟢

Groups summary

🟢 Format Scan Tests workflow

Check ID	Status
format-scan (pylint)	success	✅
format-scan (bandit)	success	✅
format-scan (cloc)	success	✅
format-scan (cpplint)	success	✅

These checks are required after the changes to intel_extension_for_transformers/transformers/utils/config.py.

🟢 Optimize Unit Test workflow

Check ID	Status
optimize-unit-test-baseline	success	✅
optimize-unit-test-PR-test	success	✅
Genreate-OptimizeUT-Report	success	✅

These checks are required after the changes to intel_extension_for_transformers/transformers/utils/config.py.

🟢 Engine Unit Test workflow

Check ID	Status
engine-unit-test-baseline	success	✅
engine-unit-test-PR-test	success	✅
Genreate-Engine-Report	success	✅

These checks are required after the changes to intel_extension_for_transformers/transformers/utils/config.py.

Thank you for your contribution! 💜

Note
This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact VincyZhang or XuehaoSun for help.

for more information, see https://pre-commit.ci

Signed-off-by: changwangss <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Wang, Chang <[email protected]>

changwangss · 2024-08-05T04:59:53Z

offline validated chatglm2, dolly v2 3b.

Kaihui-intel

pls update other configs (rtn, gptq and others)

intel-extension-for-transformers/intel_extension_for_transformers/transformers/utils/config.py

Line 834 in b400cb9

    
           self.llm_int8_skip_modules = kwargs.get("llm_int8_skip_modules", ["lm_head", "output_layer", "embed_out"])

Signed-off-by: Wang, Chang <[email protected]>

for more information, see https://pre-commit.ci

changwangss · 2024-08-06T03:02:21Z

pls update other configs (rtn, gptq and others)

intel-extension-for-transformers/intel_extension_for_transformers/transformers/utils/config.py

Line 834 in b400cb9

self.llm_int8_skip_modules = kwargs.get("llm_int8_skip_modules", ["lm_head", "output_layer", "embed_out"])

thanks review, improve.

fix int8 skip module config

ef2a1d4

Signed-off-by: changwangss <[email protected]>

changwangss requested a review from PenghuiCheng as a code owner August 5, 2024 04:20

pre-commit-ci bot and others added 4 commits August 5, 2024 04:21

[pre-commit.ci] auto fixes from pre-commit.com hooks

0bc8428

for more information, see https://pre-commit.ci

fix embed_out

a08344a

Signed-off-by: changwangss <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

7914ccc

for more information, see https://pre-commit.ci

Update modeling_auto.py

1de8b17

Signed-off-by: Wang, Chang <[email protected]>

Kaihui-intel approved these changes Aug 5, 2024

View reviewed changes

changwangss and others added 2 commits August 6, 2024 11:00

Update config.py

827c95d

Signed-off-by: Wang, Chang <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

439511c

for more information, see https://pre-commit.ci

XuehaoSun merged commit 6fadb18 into main Aug 9, 2024
17 checks passed

XuehaoSun deleted the wangchang/fix_config branch August 9, 2024 03:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix int8 skip module config #1682

fix int8 skip module config #1682

changwangss commented Aug 5, 2024

github-actions bot commented Aug 5, 2024 •

edited

Loading

changwangss commented Aug 5, 2024

Kaihui-intel left a comment

changwangss commented Aug 6, 2024

fix int8 skip module config #1682

fix int8 skip module config #1682

Conversation

changwangss commented Aug 5, 2024

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

github-actions bot commented Aug 5, 2024 • edited Loading

⚡ Required checks status: All passing 🟢

Groups summary

changwangss commented Aug 5, 2024

Kaihui-intel left a comment

Choose a reason for hiding this comment

changwangss commented Aug 6, 2024

github-actions bot commented Aug 5, 2024 •

edited

Loading