Skip to content
This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

fix int8 skip module config #1682

Merged
merged 7 commits into from
Aug 9, 2024
Merged

fix int8 skip module config #1682

merged 7 commits into from
Aug 9, 2024

Conversation

changwangss
Copy link
Contributor

Type of Change

feature or bug fix or documentation or others
API changed or not

Description

detail description
JIRA ticket: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: changwangss <[email protected]>
Copy link

github-actions bot commented Aug 5, 2024

⚡ Required checks status: All passing 🟢

Groups summary

🟢 Format Scan Tests workflow
Check ID Status Error details
format-scan (pylint) success
format-scan (bandit) success
format-scan (cloc) success
format-scan (cpplint) success

These checks are required after the changes to intel_extension_for_transformers/transformers/utils/config.py.

🟢 Optimize Unit Test workflow
Check ID Status Error details
optimize-unit-test-baseline success
optimize-unit-test-PR-test success
Genreate-OptimizeUT-Report success

These checks are required after the changes to intel_extension_for_transformers/transformers/utils/config.py.

🟢 Engine Unit Test workflow
Check ID Status Error details
engine-unit-test-baseline success
engine-unit-test-PR-test success
Genreate-Engine-Report success

These checks are required after the changes to intel_extension_for_transformers/transformers/utils/config.py.


Thank you for your contribution! 💜

Note
This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact VincyZhang or XuehaoSun for help.

@changwangss
Copy link
Contributor Author

offline validated chatglm2, dolly v2 3b.

Copy link
Contributor

@Kaihui-intel Kaihui-intel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

pls update other configs (rtn, gptq and others)

self.llm_int8_skip_modules = kwargs.get("llm_int8_skip_modules", ["lm_head", "output_layer", "embed_out"])

@changwangss
Copy link
Contributor Author

pls update other configs (rtn, gptq and others)

self.llm_int8_skip_modules = kwargs.get("llm_int8_skip_modules", ["lm_head", "output_layer", "embed_out"])

thanks review, improve.

@XuehaoSun XuehaoSun merged commit 6fadb18 into main Aug 9, 2024
17 checks passed
@XuehaoSun XuehaoSun deleted the wangchang/fix_config branch August 9, 2024 03:14
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants