-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[fix] Use MSC for checking checkpoint existence
community-request
#4251
opened Apr 10, 2026 by
pavelgein
Loading…
1 of 5 tasks
feat(moe): add DenseMixer dense forward pass for improved router grad…
community-request
#4250
opened Apr 10, 2026 by
meinie0826
Loading…
4 of 5 tasks
Enable dbias_dprob triton kernel in TE GroupedLinear
#4242
opened Apr 9, 2026 by
vasunvidia
Contributor
Loading…
5 tasks
RL: Offload optimizer via torch_memory_saver
complexity: medium
Final Review
PR is in the "final review" stage
Fix MoE checkpoint save skipping TP ranks when TP > EP * ETP (#4200)
community-request
#4234
opened Apr 9, 2026 by
aarushisingh04
Loading…
1 of 5 tasks
feat(optimizer): add FlashAdamW optimizer integration
community-request
Final Review
PR is in the "final review" stage
#4229
opened Apr 9, 2026 by
meinie0826
Loading…
4 of 5 tasks
build: bump DeepEP to 34152ae
Approved
All necessary approvals have been made
complexity: low
core_r0.17.0
Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
Minor improvements for Dynamic-cp
#4226
opened Apr 9, 2026 by
xiaoyao0115
Contributor
Loading…
5 tasks
Clean up remaining HAVE_TE gating in GPT experimental specs and inference controller
community-request
#4223
opened Apr 9, 2026 by
returnL
Contributor
Loading…
1 of 5 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2026-04-07.