-
Notifications
You must be signed in to change notification settings - Fork 211
Conversation
Signed-off-by: changwangss <[email protected]>
⚡ Required checks status: All passing 🟢Groups summary🟢 Format Scan Tests workflow
These checks are required after the changes to 🟢 Optimize Unit Test workflow
These checks are required after the changes to 🟢 Engine Unit Test workflow
These checks are required after the changes to Thank you for your contribution! 💜
|
for more information, see https://pre-commit.ci
Signed-off-by: changwangss <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Wang, Chang <[email protected]>
offline validated chatglm2, dolly v2 3b. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
pls update other configs (rtn, gptq and others)
intel-extension-for-transformers/intel_extension_for_transformers/transformers/utils/config.py
Line 834 in b400cb9
self.llm_int8_skip_modules = kwargs.get("llm_int8_skip_modules", ["lm_head", "output_layer", "embed_out"]) |
Signed-off-by: Wang, Chang <[email protected]>
for more information, see https://pre-commit.ci
thanks review, improve. |
Type of Change
feature or bug fix or documentation or others
API changed or not
Description
detail description
JIRA ticket: xxx
Expected Behavior & Potential Risk
the expected behavior that triggered by this PR
How has this PR been tested?
how to reproduce the test (including hardware information)
Dependency Change?
any library dependency introduced or removed