-
Notifications
You must be signed in to change notification settings - Fork 4.2k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[experimental] Add experimental/agent_compose placeholder with preview pointer
docs-only
documentation only (docs or docstrings)
#5639
opened Jul 3, 2026 by
ISEEKYAN
Contributor
Loading…
[dev] moe(perf): Support fused pre-GDR path for chunkwise CP.
#5638
opened Jul 3, 2026 by
yuzhongw-nvidia
Contributor
•
Draft
6 tasks
[main] moe(feat): support chunkwise context parallelism for GDN
#5637
opened Jul 3, 2026 by
yuzhongw-nvidia
Contributor
•
Draft
6 tasks
[Dev] Fix FSDP backward hooks for TE-fused experts
#5636
opened Jul 3, 2026 by
lhb8125
Contributor
Loading…
Fix cu_seqlens broadcast shape for mbs > 1 with TP > 1
complexity: low
Final Review
PR is in the "final review" stage
#5635
opened Jul 3, 2026 by
deepakn94
Contributor
Loading…
4 tasks
Inference: Add super_v3 and ultra_v3 reasoning parsers
#5634
opened Jul 2, 2026 by
sidsingh-nvidia
Contributor
•
Draft
6 tasks
Update base image to nvcr.io/nvidia/pytorch:26.06-py3
complexity: medium
Run functional tests
#5632
opened Jul 2, 2026 by
balasaajay
Contributor
Loading…
1 of 6 tasks
Scatter embeddings for sequence parallelism in standalone LM forwards
complexity: low
#5628
opened Jul 2, 2026 by
kevalmorabia97
Contributor
Loading…
chore: nightly sync main into dev (02_07_2026)
Run functional tests
Run MBridge tests
Attach this for testing this PR against MBridge main
#5627
opened Jul 2, 2026 by
svcnvidia-nemo-ci
•
Draft
fix(examples): handle missing SFT chat template
community-request
#5626
opened Jul 2, 2026 by
fallintoplace
Loading…
Ignore contributor DCO failures in Claude fix
Approved
All necessary approvals have been made
complexity: low
#5625
opened Jul 2, 2026 by
Phlip79
Member
Loading…
1 task done
DO NOT MERGE: NVLS=1 symmetric-memory CI diagnostic
complexity: medium
#5624
opened Jul 2, 2026 by
wujingyue
Contributor
Loading…
Short-circuit condition to avoid copying from GPU memory in
ChainedOptimizer
#5623
opened Jul 2, 2026 by
filaretov
Loading…
1 task done
chore: AUT-673 Update Docker image version to 26.06-py3
ci
Run functional tests
#5622
opened Jul 2, 2026 by
svcnemo-autobot
Collaborator
•
Draft
chore: AUT-672 Update Docker image version to 26.06-py3
ci
Run functional tests
#5621
opened Jul 2, 2026 by
svcnemo-autobot
Collaborator
•
Draft
[dev] partial cuda graph support for dynamic cp
#5618
opened Jul 2, 2026 by
HaochenYuan
Contributor
Loading…
6 tasks
Inference: Add profile endpoints to chat completions.
#5611
opened Jul 1, 2026 by
sidsingh-nvidia
Contributor
•
Draft
6 tasks
Callback system
complexity: medium
#5610
opened Jul 1, 2026 by
maanug-nv
Contributor
Loading…
6 tasks
return prefix cache hits data from the chat completions api
#5609
opened Jul 1, 2026 by
sidsingh-nvidia
Contributor
•
Draft
1 of 6 tasks
Triton kernels - avoid recompilation and autotuning in prod
complexity: low
Final Review
PR is in the "final review" stage
#5608
opened Jul 1, 2026 by
sidsingh-nvidia
Contributor
Loading…
1 of 6 tasks
Inference: Add load aware routing to prefix caching.
#5607
opened Jul 1, 2026 by
sidsingh-nvidia
Contributor
•
Draft
1 of 6 tasks
Guard nvrx __version__ + degrade async support gracefully for older nvidia-resiliency-ext
#5605
opened Jul 1, 2026 by
yeyu-nvidia
Contributor
•
Draft
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.