-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix release tests: remove --global-batch-size conflicting with --step-batch-size-schedule
complexity: low
#4545
opened Apr 30, 2026 by
deepakn94
Contributor
Loading…
1 task
revert: replace rampup batch size scheduler with custom step batch size schedules (#4411)
complexity: high
#4543
opened Apr 29, 2026 by
ko3n1g
Contributor
Loading…
docs: use @file-path notation for file references in skills
docs-only
documentation only (docs or docstrings)
#4542
opened Apr 29, 2026 by
ko3n1g
Contributor
Loading…
ci: add base_sha to codecov/codecov-action upload step
complexity: low
#4540
opened Apr 29, 2026 by
chtruong814
Contributor
•
Queued
5 tasks
Skip cu_seqlens broadcast when packed sequences are not in use
complexity: low
#4536
opened Apr 29, 2026 by
tomlifu
Contributor
Loading…
5 tasks
mmiranda working on another set of broken links
#4534
opened Apr 29, 2026 by
megnvidia
Contributor
Loading…
5 tasks
Move policy epoch stats to the message object
complexity: low
Final Review
PR is in the "final review" stage
#4533
opened Apr 29, 2026 by
ArEsKay3
Contributor
Loading…
5 tasks
ci: Update CI workflow conditions to include merge group handling
#4532
opened Apr 29, 2026 by
balasaajay
Contributor
•
Draft
5 tasks
[Split 1/N of #3430] feat(mHC): basic pytorch implementation of manifold hyper connection
complexity: high
Run functional tests
#4531
opened Apr 29, 2026 by
Connor-XY
Loading…
3 of 4 tasks
fix: update fine-grained offload assertion for full-iteration cuda gr…
#4525
opened Apr 29, 2026 by
rapatel
Contributor
Loading…
5 tasks
bump emerging-optimizer package for deepseek v4 coefficients type
complexity: medium
#4523
opened Apr 29, 2026 by
FDecaYed
Contributor
Loading…
5 tasks
additional tests for nvrx
complexity: low
Expert Review
[deprecated] Apply this label to indicate that your PR is ready for expert review.
#4522
opened Apr 29, 2026 by
dimapihtar
Contributor
Loading…
5 tasks
Allow optimizer CG to share the same pool as full-iter CG
#4521
opened Apr 29, 2026 by
nanz-nv
Contributor
Loading…
5 tasks
[dev] [DeepSeek-v4] Part 3: MTP support with mHC and new mHC contract
dev branch
Dev branch related issues and development
Add a knob to throttle the max allowed inflight offload in fine grained offloading
#4514
opened Apr 29, 2026 by
nanz-nv
Contributor
Loading…
5 tasks
fix: update fine-grained offload assertion for full-iteration cuda gr…
#4513
opened Apr 29, 2026 by
rapatel
Contributor
Loading…
5 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.