Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

M-LM - M-Bridge Data consolidation Run functional tests
#5409 opened Jun 20, 2026 by asolergi-nv Contributor Draft
1 of 6 tasks
fix: Harden Claude GitHub workflows complexity: medium
#5408 opened Jun 20, 2026 by chtruong814 Contributor Loading…
debug: Test Claude Review community-request Final Review PR is in the "final review" stage
#5407 opened Jun 20, 2026 by CharlieTruong Loading…
6 tasks
Fix merges_file kwarg name in HuggingFaceTokenizer community-request
#5406 opened Jun 20, 2026 by muyihao Loading…
3 of 6 tasks
Remove Modelopt loading fn
#5405 opened Jun 19, 2026 by AAnoosheh Contributor Draft
2 of 6 tasks
Consistent oncall schedule complexity: low
#5404 opened Jun 18, 2026 by Phlip79 Member Loading…
1 task done
Rename CP batch helpers to describe balancing granularity complexity: low Final Review PR is in the "final review" stage
#5403 opened Jun 18, 2026 by deepakn94 Contributor Loading…
1 task done
chore: nightly sync main into dev (18_06_2026) complexity: high Run functional tests Run MBridge tests Attach this for testing this PR against MBridge main
#5402 opened Jun 18, 2026 by svcnvidia-nemo-ci Loading…
Fix fast-cache-load rank synchronization guard community-request waiting-on-customer Waiting on the original author to respond
#5398 opened Jun 18, 2026 by sandyhouse Loading…
1 task
[main] moe(perf): Refactor GDN A2A helper flow complexity: medium
#5392 opened Jun 17, 2026 by yuzhongw-nvidia Contributor Loading…
1 of 6 tasks
Add experimental decoupled compact LayerWise DDP layout for Muon (main)
#5391 opened Jun 17, 2026 by Wohox Contributor Draft
3 of 6 tasks
Test stacked PRs
#5390 opened Jun 17, 2026 by wujingyue Contributor Draft
[dev] Add experimental decoupled compact LayerWise DDP layout for Muon complexity: medium
#5388 opened Jun 17, 2026 by Wohox Contributor Loading…
3 of 6 tasks
Add experimental Megatron-FSDP fully_shard implementation complexity: medium Final Review PR is in the "final review" stage MFSDPv2 Run tests
#5387 opened Jun 17, 2026 by wujingyue Contributor Loading…
Fix fused MLA down projection with tensor parallelism complexity: low Final Review PR is in the "final review" stage
#5383 opened Jun 16, 2026 by sraman-rgb Contributor Loading…
6 tasks
Add generic interface for SSM inference
#5382 opened Jun 16, 2026 by santhnm2 Contributor Draft
6 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.