Integrate weight-only quantizaion of INC #417

mengniwang95 · 2023-08-27T06:03:52Z

This PR integrate weight-only quantization of neural compressor into optimum-intel.

Notice: Need to use the master branch for test

hshen14 · 2023-08-27T07:34:30Z

@echarlaix Could you please help review this PR? INC supports production-level quality of weight-only quantization including INT8 and INT4 for LLMs in latest master (also be released in INC v2.3 in early Sep). Thanks.

HuggingFaceDocBuilderDev · 2023-08-27T07:37:17Z

The documentation is not available anymore as the PR was closed or merged.

echarlaix

Thanks a lot for the addition @mengniwang95

examples/neural_compressor/language-modeling/run_clm.py

optimum/intel/neural_compressor/quantization.py

tests/neural_compressor/test_optimization.py

echarlaix

Looks great, thanks for the addition @mengniwang95 !

mengniwang95 · 2023-09-11T13:48:19Z

Hi Ella, currently UT fails since it doesn't use the latest master code. Do we need to wait neural-compressor 2.3 release for UT test and merge this PR after all test passing? @echarlaix

echarlaix · 2023-09-11T16:26:43Z

Hi Ella, currently UT fails since it doesn't use the latest master code. Do we need to wait neural-compressor 2.3 release for UT test and merge this PR after all test passing? @echarlaix

For when is the neural-compressor release planned ? This PR is compatible with the current INC latest version so I'm ok to with merging it now (the test can be added in an other PR with INC being installed from source)

mengniwang95 · 2023-09-14T06:33:46Z

Hi Ella, currently UT fails since it doesn't use the latest master code. Do we need to wait neural-compressor 2.3 release for UT test and merge this PR after all test passing? @echarlaix

For when is the neural-compressor release planned ? This PR is compatible with the current INC latest version so I'm ok to with merging it now (the test can be added in an other PR with INC being installed from source)

Hi Ella, neural-compressor release is planned on 9/15. I add INT4 UT in this branch, but it is not triggered due to neural-compressor < 2.3

hshen14 · 2023-09-14T08:10:15Z

@echarlaix it seems some tests failed, while they may not be related with the changes. Could you please help check, or is it okay to get this PR merged?

echarlaix · 2023-09-14T08:43:44Z

@echarlaix it seems some tests failed, while they may not be related with the changes. Could you please help check, or is it okay to get this PR merged?

Could you update your branch by rebasing from main ? This will fix all unrelated tests. The INC tests are failing, because the release is previewed for tomorrow I think we should install neural-compressor from source here to verify all the tests are passing and then we can merge

Signed-off-by: Mengni Wang <[email protected]>

echarlaix reviewed Aug 29, 2023

View reviewed changes

echarlaix reviewed Sep 11, 2023

View reviewed changes

tests/neural_compressor/test_optimization.py Outdated Show resolved Hide resolved

echarlaix approved these changes Sep 11, 2023

View reviewed changes

mengniwang95 added 10 commits September 15, 2023 07:56

Integrate weight-only quantizaion of INC

6cdaeee

Signed-off-by: Mengni Wang <[email protected]>

add ut

fcea184

Signed-off-by: Mengni Wang <[email protected]>

reformat files

aae58d8

Signed-off-by: Mengni Wang <[email protected]>

update files

c767ea3

Update quantization.py

364bc32

Update run_clm.py

a54d2e2

Update test_optimization.py

69f39b9

Update README.md

a4170bc

Update quantization.py

0bbe65e

Update test_optimization.py

1c088c5

mengniwang95 force-pushed the main branch from e93dc19 to 1c088c5 Compare September 15, 2023 10:34

echarlaix merged commit a7782ae into huggingface:main Sep 15, 2023
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate weight-only quantizaion of INC #417

Integrate weight-only quantizaion of INC #417

mengniwang95 commented Aug 27, 2023

hshen14 commented Aug 27, 2023

HuggingFaceDocBuilderDev commented Aug 27, 2023 •

edited

Loading

echarlaix left a comment

echarlaix left a comment

mengniwang95 commented Sep 11, 2023

echarlaix commented Sep 11, 2023

mengniwang95 commented Sep 14, 2023

hshen14 commented Sep 14, 2023

echarlaix commented Sep 14, 2023

Integrate weight-only quantizaion of INC #417

Integrate weight-only quantizaion of INC #417

Conversation

mengniwang95 commented Aug 27, 2023

hshen14 commented Aug 27, 2023

HuggingFaceDocBuilderDev commented Aug 27, 2023 • edited Loading

echarlaix left a comment

Choose a reason for hiding this comment

echarlaix left a comment

Choose a reason for hiding this comment

mengniwang95 commented Sep 11, 2023

echarlaix commented Sep 11, 2023

mengniwang95 commented Sep 14, 2023

hshen14 commented Sep 14, 2023

echarlaix commented Sep 14, 2023

HuggingFaceDocBuilderDev commented Aug 27, 2023 •

edited

Loading