Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deberta 预训练的输出如何使用 #453

Open
yuzicx opened this issue Feb 18, 2024 · 0 comments
Open

Deberta 预训练的输出如何使用 #453

yuzicx opened this issue Feb 18, 2024 · 0 comments

Comments

@yuzicx
Copy link

yuzicx commented Feb 18, 2024

您好,我使用/examples/pretrain_erlangshen_deberta_v2/pretrain_deberta_base.sh进行了实验。
程序运行结束后,在workspace对应目录下找到了ckpt和lightning_logs两个文件夹,但是没有见到.bin模型文件
ckpt下存在如last.ckpt或model-epepoch=04-ststep=21950.ckpt的文件夹,结构是一样的
last.ckpt下的checkpoint文件夹中存在两个文件
mp_rank_00_model_states.pt和zero_pp_rank_0_mp_rank_00_optim_states.pt
请问我应该如何读取训练好的模型并用于推理呢

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant