[Optimization] Text-generation support qwen #513

changwangss · 2023-10-20T05:59:51Z

Type of Change

Qwen/Qwen-7B, Qwen/Qwen-14B， Qwen/Qwen-7B-Chat， Qwen/Qwen-14B-Chat pass，

optimum: huggingface/optimum#1470

optimum-intel: huggingface/optimum-intel#458

Description

detail description
JIRA ticket: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: Wenxin Zhang <[email protected]>

* Support StreamingLLM on CPU Signed-off-by: zhenwei-intel <[email protected]>

…ransformers

Signed-off-by: changwangss <[email protected]> Signed-off-by: Wenxin Zhang <[email protected]> Signed-off-by: Wang, Chang <[email protected]> Co-authored-by: Wenxin Zhang <[email protected]>

…ransformers

Signed-off-by: changwangss <[email protected]>

* Fix ChatGLM2 model loading issue Signed-off-by: lvliang-intel <[email protected]>

Signed-off-by: Haihao Shen <[email protected]>

Signed-off-by: lvliang-intel <[email protected]> Co-authored-by: VincyZhang <[email protected]>

Signed-off-by: changwangss <[email protected]>

* support Memcpy2D * support gelu fusion --------- Co-authored-by: luoyu-intel <[email protected]>

Signed-off-by: Xin He <[email protected]>

* Update README.md to new added docker setup session Signed-off-by: Louie Tsai <[email protected]>

Signed-off-by: Haihao Shen <[email protected]>

Signed-off-by: Wang, Chang <[email protected]>

Signed-off-by: Haihao Shen <[email protected]>

Signed-off-by: Louie Tsai <[email protected]>

convertion -> conversion Signed-off-by: Ikko Eltociear Ashimine <[email protected]>

Signed-off-by: Haihao Shen <[email protected]>

…ransformers

This reverts commit 5f4175a.

Signed-off-by: Haihao Shen <[email protected]>

Signed-off-by: ayushrakesh <[email protected]>

Signed-off-by: Surav Shrestha <[email protected]>

Signed-off-by: Aditya Aryaman Das <[email protected]>

VincyZhang · 2023-10-23T08:05:26Z

Unit Test failed with lines coverage decrease -0.064%
Unit Test failed with branches coverage decrease -0.158%

changwangss · 2023-10-23T09:16:48Z

Unit Test failed with lines coverage decrease -0.064% Unit Test failed with branches coverage decrease -0.158%

yes，it is as expected. qwen doesn't have tiny model to add ut. After qwen is officially included by transformers, the newly added code in generate dummy past-kv func can be deleted, the coverage will improve.
PR is ready, please merge. @VincyZhang

zhewang1-intc and others added 15 commits October 19, 2023 13:22

[CPP Graph] Opt qbits dequant (#465)

f04d0fd

use INC 2.3.1

4adacf1

Signed-off-by: Wenxin Zhang <[email protected]>

use INC 2.3.1 (#500)

d962f58

Signed-off-by: Wenxin Zhang <[email protected]>

[RUNTIME] Enabing streaming llm for Runtime (#501)

66238a5

* Support StreamingLLM on CPU Signed-off-by: zhenwei-intel <[email protected]>

Merge branch 'main' of https://github.com/intel/intel-extension-for-t…

ea112e7

…ransformers

Reduce the UT evaluation time (#498)

51485c6

Signed-off-by: changwangss <[email protected]> Signed-off-by: Wenxin Zhang <[email protected]> Signed-off-by: Wang, Chang <[email protected]> Co-authored-by: Wenxin Zhang <[email protected]>

Merge branch 'main' of https://github.com/intel/intel-extension-for-t…

ff4abb8

…ransformers

Minor fix (#507)

9bdc764

support qwen

6bd2b60

Signed-off-by: changwangss <[email protected]>

Fix ChatGLM2 model loading issue (#510)

ea720c2

* Fix ChatGLM2 model loading issue Signed-off-by: lvliang-intel <[email protected]>

Update README.md

02523e9

Signed-off-by: Haihao Shen <[email protected]>

Remove OneDNN env setint for BF16 inference (#509)

0cff05a

Signed-off-by: lvliang-intel <[email protected]> Co-authored-by: VincyZhang <[email protected]>

remove invalid code

1bee379

Signed-off-by: changwangss <[email protected]>

support Avx2 (#493)

ea69f9a

* support Memcpy2D * support gelu fusion --------- Co-authored-by: luoyu-intel <[email protected]>

add neuralchat ut for audio util (#466)

f7d0d97

changwangss requested a review from PenghuiCheng as a code owner October 20, 2023 05:59

xin3he and others added 10 commits October 20, 2023 16:18

reduce ut time consumption (#499)

b9155ef

Signed-off-by: Xin He <[email protected]>

update python api readme (#504)

5f4175a

Add docker setup session for neuralchat finetuning sample (#496)

a8873ea

* Update README.md to new added docker setup session Signed-off-by: Louie Tsai <[email protected]>

Update README.md

22fe7ad

Signed-off-by: Haihao Shen <[email protected]>

Update run_generation.py

53b1b61

Signed-off-by: Wang, Chang <[email protected]>

Update README.md

b38241d

Signed-off-by: Haihao Shen <[email protected]>

Update README.md

1d91245

Signed-off-by: Haihao Shen <[email protected]>

Update README.md

18d9c57

Signed-off-by: Haihao Shen <[email protected]>

Update README.md

f98d72a

Signed-off-by: Haihao Shen <[email protected]>

Update README.md

0f6aee6

Signed-off-by: Haihao Shen <[email protected]>

hshen14 approved these changes Oct 20, 2023

View reviewed changes

louie-tsai and others added 3 commits October 21, 2023 18:07

Update README.md for fast token issue (#515)

a8db98f

Signed-off-by: Louie Tsai <[email protected]>

Fix typo in README.md (#516)

52717e4

convertion -> conversion Signed-off-by: Ikko Eltociear Ashimine <[email protected]>

Update README.md

3cf68ee

Signed-off-by: Haihao Shen <[email protected]>

hshen14 and others added 12 commits October 21, 2023 18:50

Update README.md

7fed478

Signed-off-by: Haihao Shen <[email protected]>

Update README.md

dc81e4c

Signed-off-by: Haihao Shen <[email protected]>

improve Avx2 (#511)

dcfbcfd

Merge branch 'main' of https://github.com/intel/intel-extension-for-t…

a615905

…ransformers

Revert "update python api readme (#504)"

61993cc

This reverts commit 5f4175a.

Merge branch 'main' into wangchang/qwen

4144197

Update README.md

5b01e95

Signed-off-by: Haihao Shen <[email protected]>

Update README.md (#519)

bfb6a25

Signed-off-by: ayushrakesh <[email protected]>

docs: fix typos in question answering of pytorch (#520)

0e0a9eb

Signed-off-by: Surav Shrestha <[email protected]>

fixed typos (#522)

ec29f2f

Updated README.md (#517)

1357a02

Signed-off-by: Aditya Aryaman Das <[email protected]>

Merge branch 'main' into wangchang/qwen

b3e4b25

VincyZhang force-pushed the main branch from 1357a02 to f04d0fd Compare October 23, 2023 03:40

VincyZhang requested review from VincyZhang, lvliang-intel, zhenwei-intel and airMeng as code owners October 23, 2023 03:40

VincyZhang force-pushed the main branch from f04d0fd to 1ab6ce3 Compare October 23, 2023 03:47

VincyZhang requested a review from a32543254 as a code owner October 23, 2023 03:47

Merge branch 'main' into wangchang/qwen

572ecbf

Merge branch 'main' into wangchang/qwen

2e77b6b

VincyZhang approved these changes Oct 23, 2023

View reviewed changes

hshen14 changed the title ~~Text-generation support qwen~~ [WIP] Text-generation support qwen Oct 23, 2023

VincyZhang changed the title ~~[WIP] Text-generation support qwen~~ [Optimize] Text-generation support qwen Oct 23, 2023

VincyZhang changed the title ~~[Optimize] Text-generation support qwen~~ [Optimization] Text-generation support qwen Oct 23, 2023

VincyZhang merged commit 8f41d49 into main Oct 23, 2023
15 of 16 checks passed

VincyZhang deleted the wangchang/qwen branch October 23, 2023 14:50

VincyZhang pushed a commit that referenced this pull request Oct 26, 2023

[Optimization] Text-generation support qwen (#513)

f78d114

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Optimization] Text-generation support qwen #513

[Optimization] Text-generation support qwen #513

changwangss commented Oct 20, 2023 •

edited

Loading

VincyZhang commented Oct 23, 2023

changwangss commented Oct 23, 2023 •

edited

Loading

[Optimization] Text-generation support qwen #513

[Optimization] Text-generation support qwen #513

Conversation

changwangss commented Oct 20, 2023 • edited Loading

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

VincyZhang commented Oct 23, 2023

changwangss commented Oct 23, 2023 • edited Loading

changwangss commented Oct 20, 2023 •

edited

Loading

changwangss commented Oct 23, 2023 •

edited

Loading