Uh oh!

There was an error while loading. Please reload this page.

NVIDIA / Megatron-LM Public

Notifications You must be signed in to change notification settings
Fork 4.2k
Star 16.9k

Code
Issues 370
Pull requests 589
Discussions
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: NVIDIA/Megatron-LM

Labels 66 Milestones 2

New pull request New

589 Open 3,318 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[codex] Separate mFSDP v2 unit tests Run tests

#5640 opened Jul 3, 2026 by wujingyue Contributor • Draft

[experimental] Add experimental/agent_compose placeholder with preview pointer docs-only

documentation only (docs or docstrings)

#5639 opened Jul 3, 2026 by ISEEKYAN Contributor

Loading…

[dev] moe(perf): Support fused pre-GDR path for chunkwise CP.

#5638 opened Jul 3, 2026 by yuzhongw-nvidia Contributor • Draft

6 tasks

[main] moe(feat): support chunkwise context parallelism for GDN

#5637 opened Jul 3, 2026 by yuzhongw-nvidia Contributor • Draft

6 tasks

[Dev] Fix FSDP backward hooks for TE-fused experts

#5636 opened Jul 3, 2026 by lhb8125 Contributor

Loading…

Fix cu_seqlens broadcast shape for mbs > 1 with TP > 1 complexity: low Final Review

PR is in the "final review" stage

#5635 opened Jul 3, 2026 by deepakn94 Contributor

Loading…

4 tasks

Inference: Add super_v3 and ultra_v3 reasoning parsers

#5634 opened Jul 2, 2026 by sidsingh-nvidia Contributor • Draft

6 tasks

Update base image to nvcr.io/nvidia/pytorch:26.06-py3 complexity: medium Run functional tests

#5632 opened Jul 2, 2026 by balasaajay Contributor

Loading…

1 of 6 tasks

Scatter embeddings for sequence parallelism in standalone LM forwards complexity: low

#5628 opened Jul 2, 2026 by kevalmorabia97 Contributor

Loading…

chore: nightly sync main into dev (02_07_2026) Run functional tests Run MBridge tests

Attach this for testing this PR against MBridge main

#5627 opened Jul 2, 2026 by svcnvidia-nemo-ci • Draft

fix(examples): handle missing SFT chat template community-request

#5626 opened Jul 2, 2026 by fallintoplace

Loading…

Ignore contributor DCO failures in Claude fix Approved

All necessary approvals have been made

complexity: low

#5625 opened Jul 2, 2026 by Phlip79 Member

Loading…

1 task done

DO NOT MERGE: NVLS=1 symmetric-memory CI diagnostic complexity: medium

#5624 opened Jul 2, 2026 by wujingyue Contributor

Loading…

Short-circuit condition to avoid copying from GPU memory in ChainedOptimizer

#5623 opened Jul 2, 2026 by filaretov

Loading…

1 task done

chore: AUT-673 Update Docker image version to 26.06-py3 ci Run functional tests

#5622 opened Jul 2, 2026 by svcnemo-autobot Collaborator • Draft

chore: AUT-672 Update Docker image version to 26.06-py3 ci Run functional tests

#5621 opened Jul 2, 2026 by svcnemo-autobot Collaborator • Draft

[dev] partial cuda graph support for dynamic cp

#5618 opened Jul 2, 2026 by HaochenYuan Contributor

Loading…

6 tasks

Add sequence packing planning API

#5612 opened Jul 2, 2026 by ilml Contributor • Draft

Inference: Add profile endpoints to chat completions.

#5611 opened Jul 1, 2026 by sidsingh-nvidia Contributor • Draft

6 tasks

Callback system complexity: medium

#5610 opened Jul 1, 2026 by maanug-nv Contributor

Loading…

6 tasks

return prefix cache hits data from the chat completions api

#5609 opened Jul 1, 2026 by sidsingh-nvidia Contributor • Draft

1 of 6 tasks

Triton kernels - avoid recompilation and autotuning in prod complexity: low Final Review

PR is in the "final review" stage

#5608 opened Jul 1, 2026 by sidsingh-nvidia Contributor

Loading…

1 of 6 tasks

Inference: Add load aware routing to prefix caching.

#5607 opened Jul 1, 2026 by sidsingh-nvidia Contributor • Draft

1 of 6 tasks

Add VLM support to DynamicInferenceEngine

#5606 opened Jul 1, 2026 by RPrenger Contributor • Draft

6 tasks

Guard nvrx __version__ + degrade async support gracefully for older nvidia-resiliency-ext

#5605 opened Jul 1, 2026 by yeyu-nvidia Contributor • Draft

Previous 1 2 3 4 5 … 23 24 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!