Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

基于qwen2-1.5B的训练 #135

Open
riariam opened this issue Nov 15, 2024 · 1 comment
Open

基于qwen2-1.5B的训练 #135

riariam opened this issue Nov 15, 2024 · 1 comment

Comments

@riariam
Copy link

riariam commented Nov 15, 2024

请问在训练qwen2-1.5B时,需要做什么特殊处理吗?感谢🙏

@YingHuTsing
Copy link
Collaborator

YingHuTsing commented Nov 16, 2024

1、首先请将tinyllava/training_recipe/base.py 这几行注释掉。
WechatIMG206
2、其次请将tinyllava/model/load_model.py这部分做上图改动,模型路径请替换成你自己finetune后的路径。
WechatIMG208
3、启动脚本请参照qwen2-0.5B的脚本。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants