-
Notifications
You must be signed in to change notification settings - Fork 284
Pull requests: NVIDIA-NeMo/Megatron-Bridge
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Megatron-FSDP] Fix segmentation fault when using precision-aware optimizers with the updated MCore.
#3510
opened Apr 24, 2026 by
cspades
Contributor
Loading…
5 tasks
[docs] feat: document min GPU count for combined dense + MoE parallelism
#3509
opened Apr 24, 2026 by
cuichenx
Contributor
Loading…
2 tasks
[ci] chore: bump .dev.commit to latest mcore-dev tested SHA
docs-only
With great power comes great responsibility.
#3507
opened Apr 23, 2026 by
cuichenx
Contributor
Loading…
1 task
[vulnops][ckpt] fix: Use weights_only=True in TrainState checkpoint loading
full-test-suite
#3506
opened Apr 23, 2026 by
yaoyu-33
Contributor
Loading…
3 of 4 tasks
[ci, recipe] test: add L1 functional coverage for qwen3_vl forward step
needs-more-tests
Requires additional L0 and L1 test coverage before merge
#3502
opened Apr 23, 2026 by
cuichenx
Contributor
Loading…
1 task
chore(beep boop 🤖): Bump Dependencies, packaging, images, and environment setup
ci
CI, automation, test queue, or workflow infrastructure work
full-test-suite
uv.lock (main, mcore-dev) (2026-04-23)
area:build
#3496
opened Apr 23, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
chore(beep boop 🤖): Bump Dependencies, packaging, images, and environment setup
ci
CI, automation, test queue, or workflow infrastructure work
full-test-suite
uv.lock (r0.4.0, mcore-core_r0.17.0) (2026-04-23)
area:build
#3495
opened Apr 23, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
auto-review PR's- Claude
area:misc
Cross-cutting utilities, logging, helpers, and other changes
ci
CI, automation, test queue, or workflow infrastructure work
needs-review
PR is ready for code review and waiting on a reviewer
#3494
opened Apr 23, 2026 by
malay-nagda
Contributor
Loading…
5 tasks
cp: Performance optimizations and benchmarking
bug
Something isn't working
cherry-pick
needs-review
PR is ready for code review and waiting on a reviewer
Run CICD
no recompute default (3470) into r0.4.0
area:perf
#3491
opened Apr 23, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
Use HybridEP flex dispatcher for Qwen3 235B B300 perf configs
area:perf
Performance optimizations and benchmarking
feature
New capabilities, enhancements, or enablement work
#3490
opened Apr 23, 2026 by
rhmukundan
Contributor
•
Draft
[training] fix: guard TECudaGraphHelper.delete_cuda_graphs() with graphs_created()
area:training
Training loop, callbacks, and runtime integration
bug
Something isn't working
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
#3488
opened Apr 22, 2026 by
yaoyu-33
Contributor
Loading…
2 tasks
[data] fix: raise on missing {% generation %} in chat template
area:data
Dataset builders, preprocessing, and samplers
bug
Something isn't working
full-test-suite
needs-review
PR is ready for code review and waiting on a reviewer
#3486
opened Apr 22, 2026 by
yaoyu-33
Contributor
Loading…
4 tasks
[ci] fix: re-enable FSDP checkpoint test after TE 2.14 fix
area:training
Training loop, callbacks, and runtime integration
ci
CI, automation, test queue, or workflow infrastructure work
#3485
opened Apr 22, 2026 by
yaoyu-33
Contributor
Loading…
1 task
fix: pass total_tokens for SSM seq_idx in packed sequences
area:training
Training loop, callbacks, and runtime integration
bug
Something isn't working
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
#3484
opened Apr 22, 2026 by
yaoyu-33
Contributor
Loading…
3 tasks
[vulnops][data] fix: Validate URLs in VLM video loader to prevent SSRF
area:model
Model implementations and HF bridge logic
bug
Something isn't working
needs-review
PR is ready for code review and waiting on a reviewer
#3482
opened Apr 22, 2026 by
yaoyu-33
Contributor
Loading…
11 tasks done
[recipe] feat: enable THD packing by default for Qwen3.5-VL finetune
area:recipe
Training recipes and launch configs
feature
New capabilities, enhancements, or enablement work
cp: Training loop, callbacks, and runtime integration
bug
Something isn't working
cherry-pick
needs-review
PR is ready for code review and waiting on a reviewer
Run CICD
Cleanup TE cuda graphs with the right api (3459) into r0.4.0
area:training
#3476
opened Apr 22, 2026 by
svcnvidia-nemo-ci
Contributor
Loading…
Explicit error when the number of training samples < global batch size
area:data
Dataset builders, preprocessing, and samplers
community-request
feature
New capabilities, enhancements, or enablement work
needs-review
PR is ready for code review and waiting on a reviewer
#3464
opened Apr 22, 2026 by
OlegSudakov
Contributor
Loading…
5 tasks
[model] fix: pass CPU initialization flag
area:model
Model implementations and HF bridge logic
bug
Something isn't working
community-request
needs-more-tests
Requires additional L0 and L1 test coverage before merge
needs-review
PR is ready for code review and waiting on a reviewer
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
#3462
opened Apr 22, 2026 by
pavelgein
Contributor
Loading…
1 of 4 tasks
[Bugfix] MiniMax-M2 full-dimension k_norm sharding when tp_size > num_kv_heads
area:model
Model implementations and HF bridge logic
bug
Something isn't working
community-request
needs-review
PR is ready for code review and waiting on a reviewer
#3458
opened Apr 21, 2026 by
HollowMan6
Contributor
Loading…
2 of 5 tasks
[examples] Rename diffusion/recipes to diffusion/models
area:diffusion
DFM module
feature
New capabilities, enhancements, or enablement work
needs-author
Author action is required before review or merge can continue
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
#3455
opened Apr 21, 2026 by
cuichenx
Contributor
Loading…
[ckpt] fix: use MSC paths
area:ckpt
Checkpoint conversion, loading, export, and save paths
bug
Something isn't working
community-request
needs-review
PR is ready for code review and waiting on a reviewer
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
waiting-on-customer
Waiting on the original author to respond
#3452
opened Apr 21, 2026 by
pavelgein
Contributor
Loading…
1 of 5 tasks
upstream MCore tokenizers config
area:training
Training loop, callbacks, and runtime integration
feature
New capabilities, enhancements, or enablement work
#3451
opened Apr 21, 2026 by
dimapihtar
Contributor
•
Draft
5 tasks
[PEFT] fix: fused fc1 projection mapping for MiniMax (Parameter-efficient fine-tuning (LoRA, adapters)
bug
Something isn't working
community-request
needs-review
PR is ready for code review and waiting on a reviewer
ready-to-merge
PR is approved, current, and only waiting for CI to pass before merge
w1 & w3)
area:peft
#3449
opened Apr 21, 2026 by
HollowMan6
Contributor
Loading…
2 of 5 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.