We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
首先,我觉得本开源项目做的十分有意义,可以在一个框架里对多个开源大模型进行测评。但是在测试adaptation下的脚本时,遇到了缺少.pth类型权重的问题。也试着将开源.safetensors类型的权重转化为.pth权重。转化完发现模型结构不一致,还是无法使用OpenLTM加载预训练权重。对比了一下项目加载的模型和预训练权重模型,发现两个模型结构不一致。以timer-xl为例:预训练权重模型的结构:model.layers.0.self_attn.q_proj.weight;OpenLTM的模型结构'blocks.attn_layers.4.norm2.weight'。最后,真挚地希望您能提供一下OpenLTM里面支持模型的预训练权重。谢谢您
The text was updated successfully, but these errors were encountered:
No branches or pull requests
首先,我觉得本开源项目做的十分有意义,可以在一个框架里对多个开源大模型进行测评。但是在测试adaptation下的脚本时,遇到了缺少.pth类型权重的问题。也试着将开源.safetensors类型的权重转化为.pth权重。转化完发现模型结构不一致,还是无法使用OpenLTM加载预训练权重。对比了一下项目加载的模型和预训练权重模型,发现两个模型结构不一致。以timer-xl为例:预训练权重模型的结构:model.layers.0.self_attn.q_proj.weight;OpenLTM的模型结构'blocks.attn_layers.4.norm2.weight'。最后,真挚地希望您能提供一下OpenLTM里面支持模型的预训练权重。谢谢您
The text was updated successfully, but these errors were encountered: