-
Notifications
You must be signed in to change notification settings - Fork 59
Pull requests: InternLM/InternEvo
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(*): Add internlm3 config
enhancement
New feature or request
#403
by yingtongxiong
was merged Jan 22, 2025
Loading…
1 task done
fix(hf): fix convert_inetrnevo2hf for internlm2 model
#401
by zigzagcai
was merged Jan 17, 2025
Loading…
6 tasks
fix(hybrid optim): fp32_grad not scaled when use offload_cpu
#399
by fengsibo
was merged Jan 6, 2025
Loading…
1 of 6 tasks
feat(parallel_context.py): remove useless gqa process group
#390
by huangting4201
was merged Dec 17, 2024
Loading…
6 tasks
feat(loss)/add different operator types for cross_entropy
#386
by yingtongxiong
was merged Dec 17, 2024
Loading…
1 task done
fix(mlp.py): swap mlp w1w2w3 init order to w1w3w2 and fix QA
#384
by li126com
was merged Dec 6, 2024
Loading…
feat(comm/attn_offload.py): support selective ckpt and cpu offload
#383
by huangting4201
was merged Dec 31, 2024
Loading…
5 of 6 tasks
feat(isp): support switch for launch ag and forward overlap per module
enhancement
New feature or request
#381
by huangting4201
was merged Dec 17, 2024
Loading…
5 of 6 tasks
fix(pp): fix pp get tensor shape err and layernorm input dtype err
bug
Something isn't working
#378
by huangting4201
was merged Dec 10, 2024
Loading…
5 of 6 tasks
fix(linear.py): linear module uneven split is forbidden
#374
by huangting4201
was merged Nov 27, 2024
Loading…
fix(gmm): change communicator.grad_hook to async
#371
by blankde
was merged Dec 10, 2024
Loading…
6 tasks
feat(fp8): [Work In Progress] enable FP8 training
#369
by zigzagcai
was closed Jan 10, 2025
Loading…
6 tasks
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.