Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问能在megatron基础上使用吗 #9

Open
pangsg opened this issue Aug 11, 2022 · 4 comments
Open

请问能在megatron基础上使用吗 #9

pangsg opened this issue Aug 11, 2022 · 4 comments

Comments

@pangsg
Copy link

pangsg commented Aug 11, 2022

No description provided.

@GongZhengLi
Copy link
Collaborator

nv的megatron-lm训练框架我们没有适配,目前是适配了fairseq和transformers,如果是megatron-lm的训练框架,需要进行模型转换。

@pangsg
Copy link
Author

pangsg commented Aug 11, 2022

好的,谢谢,那能适配分布式训练吗,在分布式训练的基础上会有速度的提升吗

@GongZhengLi
Copy link
Collaborator

适配分布式训练指的是训练的时候使用EET吗? 这个不行,EET是一个推理引擎,不支持反向传播。

@pangsg
Copy link
Author

pangsg commented Aug 11, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants