Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Ascend][internlm] speed_rmsandrope #53

Closed
wants to merge 4 commits into from
Closed

Conversation

SHshenhao
Copy link

优化了rmsnorm的接入,主要针对fp16
调整了ROPE,适配自行处理的internlm优化:上移华为软件栈下cos和sin repeat处理

Copy link
Collaborator

@lljbash lljbash left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

对 nv 会产生什么影响?

@lljbash lljbash added the question Further information is requested label Mar 12, 2024
@POI-WX POI-WX closed this Apr 19, 2024
@POI-WX POI-WX deleted the sh/speedup_rmsandrope branch April 24, 2024 04:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants