Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[fix] Use MSC for checking checkpoint existence community-request
#4251 opened Apr 10, 2026 by pavelgein Loading…
1 of 5 tasks
Fix RL to once again work with --skip-train complexity: low
#4249 opened Apr 10, 2026 by tdene Contributor Loading…
5 tasks
Core 0.16
Overload factor logging (squashed)
#4248 opened Apr 10, 2026 by nanz-nv Contributor Draft
5 tasks
Paged Stashing
#4247 opened Apr 10, 2026 by nanz-nv Contributor Draft
5 tasks
[Dev] Add runnable Qwen3.5 FSDP+EP in examples
#4245 opened Apr 10, 2026 by BestJuly Contributor Draft
5 tasks
Estimate speed-of-light decode latency complexity: medium
#4243 opened Apr 9, 2026 by santhnm2 Contributor Loading…
5 tasks
Core 0.16
Enable dbias_dprob triton kernel in TE GroupedLinear
#4242 opened Apr 9, 2026 by vasunvidia Contributor Loading…
5 tasks
RL: Offload optimizer via torch_memory_saver complexity: medium Final Review PR is in the "final review" stage
#4241 opened Apr 9, 2026 by tdene Contributor Draft
5 tasks
Core 0.16
optimize attention init
#4240 opened Apr 9, 2026 by wdykas Contributor Draft
5 tasks
docs: add data loading best practices for large-scale training
#4236 opened Apr 9, 2026 by sbhavani Contributor Draft
1 of 5 tasks
RL: Onload optimizer after logprobs computation complexity: low
#4235 opened Apr 9, 2026 by tdene Contributor Loading…
5 tasks
Core 0.16
Graphed update_requests
#4233 opened Apr 9, 2026 by tdene Contributor Draft
5 tasks
feat(optimizer): add FlashAdamW optimizer integration community-request Final Review PR is in the "final review" stage
#4229 opened Apr 9, 2026 by meinie0826 Loading…
4 of 5 tasks
build: bump DeepEP to 34152ae Approved All necessary approvals have been made complexity: low core_r0.17.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.
#4228 opened Apr 9, 2026 by ko3n1g Contributor Loading…
1 task
Core 0.16
Minor improvements for Dynamic-cp
#4226 opened Apr 9, 2026 by xiaoyao0115 Contributor Loading…
5 tasks
Extract args init to launch scripts complexity: low
#4225 opened Apr 9, 2026 by maanug-nv Contributor Loading…
5 tasks
Core 0.16
ProTip! Updated in the last three days: updated:>2026-04-07.