Name		Name	Last commit message	Last commit date
parent directory ..
academic_paper_scripts		academic_paper_scripts
bert		bert
gpt3		gpt3
inference		inference
mamba		mamba
mixtral		mixtral
multimodal		multimodal
retro		retro
t5		t5
README.md		README.md
llama3-70b.sh		llama3-70b.sh
llama3-8b.sh		llama3-8b.sh
preprocess-data.sh		preprocess-data.sh
run-megatron.sh		run-megatron.sh
run_simple_mcore_train_loop.py		run_simple_mcore_train_loop.py

README.md

Megatron LLaMA3-8B与LLaMA3-70B模型测试流程

环境与数据集准备

DLC镜像版本：nvcr.io/nvidia/pytorch:24.06-py3

LLaMA3仓库：

git clone https://github.com/meta-llama/llama3.git

安装依赖：

pip install tiktoken flash-attn modelscope nltk -i https://mirrors.aliyun.com/pypi/simple/

cd llama3
pip install -e . -i https://mirrors.aliyun.com/pypi/simple/

预处理数据，在Megatron-LM目录下运行，处理好的数据将保存在data目录下。
```
bash examples/preprocess_data.sh
```

运行测试

参考examples/llama3-8b.sh与examples/llama3-70b.sh脚本，设置DLC最终的启动命令为其中某一行即可执行特定配置下的测试，例如：

bash examples/run-megatron.sh --random-init --mbs 2 --gbs 8 --attn-type flash --seq-len 8192 --tp 2 --gc --gc-cnt 19

上述命令中的参数含义如下：

--random-init：随机初始化
--mbs 2：Micro batch size为2
--gbs 8：Global batch size为8
--attn-type flash：使用FlashAttention，可选项为flash、fused和unfused，分别对应FlashAttention、FusedAttention和UnfusedAttention
--seq-len 8192：序列长度为8192
--tp 2：Tensor parallel degree为2
--gc：开启GC
--gc-cnt 19：GC层数为19

如需更多定制化配置，请根据需求修改脚本中的参数设置。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

README.md

Megatron LLaMA3-8B与LLaMA3-70B模型测试流程

环境与数据集准备

运行测试

Files

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

Megatron LLaMA3-8B与LLaMA3-70B模型测试流程

环境与数据集准备

运行测试