migrate PR [LLM Runtime]Magicoder graph #41

intellinjun · 2024-01-09T08:59:22Z

Type of Change

feature or bug fix or documentation or others
API changed or not

Description

detail description
Issues: xxx

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: intellinjun <[email protected]>

a32543254

LGTM

zhenwei-intel · 2024-01-10T02:17:44Z

neural_speed/convert/convert_llama.py

@@ -178,6 +181,11 @@ def loadHFTransformerJson(model: 'LazyModel', config_path: Path) -> 'Params':
        ffn_hidden_size = config["intermediate_size"]
        rms_norm_eps = config["rms_norm_eps"]
        rope_theta = config["rope_theta"] if "rope_theta" in config else 10000
+        rope_scale = 1
+        if config["rope_scaling"]:


please check whether "rope_scaling" in config

Suggested change

if config["rope_scaling"]:

if "rope_scaling" in config and config["rope_scaling"] is not None:

zhenwei-intel · 2024-01-10T02:18:04Z

neural_speed/convert/convert_mistral.py

@@ -179,6 +180,8 @@ def loadHFTransformerJson(model: 'LazyModel', config_path: Path) -> 'Params':
        ffn_hidden_size = config["intermediate_size"]
        rms_norm_eps = config["rms_norm_eps"]
        rope_theta = config["rope_theta"] if "rope_theta" in config else 10000
+        rope_scale = config["factor"] if "factor" in config else 1


mistral should align to llama

please help update convert convert_quantized_llama.py and convert_quantized_mistral.py

Signed-off-by: intellinjun <[email protected]>

VincyZhang · 2024-01-16T02:49:12Z

tests/model-test/cpp_graph_inference.sh

        quant_script="./build/bin/quant_llama"
-        convert_script="${convert_script}/convert_bmagicoder.py"
+        convert_script="${convert_script}/convert_llama.py"


why use llama?

VincyZhang · 2024-01-16T02:57:06Z

magicoder: https://inteltf-jenk.sh.intel.com/job/neural_speed_extension/22/

intellinjun added 3 commits January 9, 2024 16:58

migrate [LLM Runtime]Magicoder graph

f219d29

Signed-off-by: intellinjun <[email protected]>

fix format error

ed3cc6e

Signed-off-by: intellinjun <[email protected]>

change convert gptj script

215031d

Signed-off-by: intellinjun <[email protected]>

intellinjun requested review from zhenwei-intel, DDEle and a32543254 January 10, 2024 02:04

a32543254 approved these changes Jan 10, 2024

View reviewed changes

zhenwei-intel reviewed Jan 10, 2024

View reviewed changes

intellinjun added 2 commits January 10, 2024 11:03

fix format error

be2a785

Signed-off-by: intellinjun <[email protected]>

delete other convert script

8106c6f

Signed-off-by: intellinjun <[email protected]>

zhenwei-intel approved these changes Jan 10, 2024

View reviewed changes

intellinjun and others added 5 commits January 10, 2024 14:24

fix format error

8744a35

Signed-off-by: intellinjun <[email protected]>

Merge branch 'main' into magicoder_graph

a6ca108

add two other magicoder pr 1082&1100

7d1ef1c

Signed-off-by: intellinjun <[email protected]>

fix format error

dc936ad

Signed-off-by: intellinjun <[email protected]>

rm mistral folder

7359bbd

Signed-off-by: intellinjun <[email protected]>

VincyZhang reviewed Jan 16, 2024

View reviewed changes

VincyZhang merged commit 749caca into main Jan 16, 2024
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

migrate PR [LLM Runtime]Magicoder graph #41

migrate PR [LLM Runtime]Magicoder graph #41

intellinjun commented Jan 9, 2024

a32543254 left a comment

zhenwei-intel Jan 10, 2024

zhenwei-intel Jan 10, 2024

zhenwei-intel Jan 10, 2024

VincyZhang Jan 16, 2024

VincyZhang commented Jan 16, 2024

	if config["rope_scaling"]:
	if "rope_scaling" in config and config["rope_scaling"] is not None:

migrate PR [LLM Runtime]Magicoder graph #41

migrate PR [LLM Runtime]Magicoder graph #41

Conversation

intellinjun commented Jan 9, 2024

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

a32543254 left a comment

Choose a reason for hiding this comment

zhenwei-intel Jan 10, 2024

Choose a reason for hiding this comment

zhenwei-intel Jan 10, 2024

Choose a reason for hiding this comment

zhenwei-intel Jan 10, 2024

Choose a reason for hiding this comment

VincyZhang Jan 16, 2024

Choose a reason for hiding this comment

VincyZhang commented Jan 16, 2024