Skip to content

Pull requests: NVIDIA/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix: don't enter branch if mtp_num_layers == 0 community-request
#2581 opened Dec 6, 2025 by rj42 Loading…
6 tasks
Ignore log level in functional test
#2579 opened Dec 5, 2025 by kwyss-nvidia Loading…
6 tasks
Synchronize total block count across pipeline parallel ranks
#2578 opened Dec 5, 2025 by santhnm2 Loading…
6 tasks
fix: ckpt loading failed because of padding metadata in dist optimizer Expert Review Apply this label to indicate that your PR is ready for expert review.
#2576 opened Dec 5, 2025 by yaoyu-33 Loading…
6 tasks
[Megatron-FSDP] Support both old and new DeviceMesh APIs. Expert Review Apply this label to indicate that your PR is ready for expert review.
#2575 opened Dec 5, 2025 by cspades Loading…
3 of 6 tasks
Core 0.16
[Dev] Improve MoE Logging
#2569 opened Dec 5, 2025 by yanring Draft
6 tasks
Core 0.16
Add offset method for slow tokenizer community-request
#2567 opened Dec 5, 2025 by cael-ling Loading…
6 tasks
feat: Api compat add decorator dev
#2545 opened Dec 4, 2025 by pablo-garay Loading…
6 tasks
Use autodoc2 and remove automodule
#2542 opened Dec 4, 2025 by Phlip79 Draft
6 tasks
feat: m4 leftover changes
#2506 opened Dec 4, 2025 by yaoyu-33 Loading…
6 tasks
Log softmax decomposition
#2497 opened Dec 4, 2025 by shanmugamr1992 Loading…
6 tasks
remove fp16 assert in moe_grouped_gemm & EP
#2495 opened Dec 4, 2025 by HaochenYuan Loading…
6 tasks
Core 0.16
ProTip! no:milestone will show everything without a milestone.