Skip to content

Commit

Permalink
Add multi step scheduling scenario to jenkins CI (#445)
Browse files Browse the repository at this point in the history
Add multi step scheduling scenario to jenkins CI
  • Loading branch information
afierka-intel authored Nov 7, 2024
1 parent 3c39626 commit 11f5da6
Show file tree
Hide file tree
Showing 4 changed files with 34 additions and 0 deletions.
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
# FIXME(kzawora): these scores were generated using vLLM on HPU, we need to confirm them on HF
# VLLM_SKIP_WARMUP=true bash run-lm-eval-gsm-cot-llama-vllm-baseline.sh -m "/mnt/weka/data/pytorch/llama3.1/Meta-Llama-3.1-8B-Instruct" -b 128 -l 1319 -f 8 -t 1
model_name: "/mnt/weka/data/pytorch/llama3.1/Meta-Llama-3.1-8B-Instruct"
tasks:
- name: "gsm8k_cot_llama"
metrics:
- name: "exact_match,strict-match"
value: 0.8317
- name: "exact_match,flexible-extract"
value: 0.8355
limit: null
num_fewshot: 8
dtype: "bfloat16"
fewshot_as_multiturn: true
apply_chat_template: true
num_scheduler_steps: 10
1 change: 1 addition & 0 deletions .jenkins/lm-eval-harness/configs/models-mss.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Meta-Llama-3.1-8B-Instruct-mss.yaml
3 changes: 3 additions & 0 deletions .jenkins/lm-eval-harness/test_lm_eval_correctness.py
Original file line number Diff line number Diff line change
Expand Up @@ -54,6 +54,9 @@ def launch_lm_eval(eval_config):
model_args += ",quantization=inc," \
"kv_cache_dtype=fp8_inc," \
"weights_load_device=cpu"
if eval_config.get("num_scheduler_steps"):
model_args += \
f",num_scheduler_steps={eval_config.get('num_scheduler_steps')}"
kwargs = {}
if 'fewshot_as_multiturn' in eval_config:
kwargs['fewshot_as_multiturn'] = eval_config['fewshot_as_multiturn']
Expand Down
14 changes: 14 additions & 0 deletions .jenkins/test_config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -27,3 +27,17 @@ stages:
- name: gsm8k_small_g3_tp1_fp8
flavor: g3
command: cd .jenkins/lm-eval-harness && bash run-tests.sh -c configs/models-fp8.txt -t 1
- name: test_gsm8k_mss
steps:
- name: gsm8k_small_g3_tp1_mss
flavor: g3
command: cd .jenkins/lm-eval-harness && bash run-tests.sh -c configs/models-mss.txt -t 1
- name: gsm8k_small_g2_tp1_mss
flavor: g2
command: cd .jenkins/lm-eval-harness && bash run-tests.sh -c configs/models-mss.txt -t 1
- name: gsm8k_small_g3_tp2_mss
flavor: g3.s
command: cd .jenkins/lm-eval-harness && bash run-tests.sh -c configs/models-mss.txt -t 2
- name: gsm8k_small_g2_tp2_mss
flavor: g2.s
command: cd .jenkins/lm-eval-harness && bash run-tests.sh -c configs/models-mss.txt -t 2

0 comments on commit 11f5da6

Please sign in to comment.