Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add draft model support community-request documentation Improvements or additions to documentation
#1921 opened Feb 10, 2026 by shaunjoshi Draft
4 tasks
refactor: refactor loss function
#1920 opened Feb 10, 2026 by yuki-97 Draft
fix: fix and re-enable rm env functional test CI:L1 Run doctests, unit tests, and functional tests
#1905 opened Feb 10, 2026 by RayenTian Loading…
chore: bump mcore and mbridge CI:L1 Run doctests, unit tests, and functional tests super-v3
#1902 opened Feb 9, 2026 by yfw Loading…
4 tasks
test: Add script for nemotron test super-v3
#1901 opened Feb 9, 2026 by guyueh1 Loading…
4 tasks
feat: ProRLv2 - add seq-mask-tis truncated importance sampling type CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1899 opened Feb 9, 2026 by hijkzzz Loading…
feat: Fuse logprob and train when rollout and train have same batch size deepseek Related to deepseek 671b
#1891 opened Feb 6, 2026 by guyueh1 Draft
4 tasks
feat: Megatron LoRA GRPO w/ Weight Merging
#1889 opened Feb 5, 2026 by vadam5 Loading…
4 tasks
feat: MXFP8 rollout support super-v3
#1887 opened Feb 5, 2026 by guyueh1 Draft
4 tasks
feat: Support build custom flashinfer CI:L2 Run doctests, unit tests, functional tests, and convergence tests super-v3
#1886 opened Feb 5, 2026 by guyueh1 Loading…
4 tasks
feat: retry rollout if generation_logprobs contains NaN CI:L2 Run doctests, unit tests, functional tests, and convergence tests super-v3
#1885 opened Feb 5, 2026 by guyueh1 Loading…
4 tasks
fix: Mxfp8 training fix sequence padding CI:L2 Run doctests, unit tests, functional tests, and convergence tests super-v3
#1884 opened Feb 5, 2026 by guyueh1 Loading…
4 tasks
feat: Add perfetto tracing for async GRPO training
#1876 opened Feb 4, 2026 by gspschmid Loading…
4 tasks
feat: add worker initialization timing collection CI:L0 Run doctests and unit tests CI:L1 Run doctests, unit tests, and functional tests
#1873 opened Feb 4, 2026 by yashaswikarnati Loading…
4 tasks
chore: bump torch 2.9.1, vllm 0.15 sglang 0.5.8, ray 2.53 dependencies Pull requests that update a dependency file
#1871 opened Feb 3, 2026 by terrykong Loading…
4 tasks
feat: add fault injection utilities for testing fault tolerance CI:L0 Run doctests and unit tests CI:L1 Run doctests, unit tests, and functional tests
#1868 opened Feb 3, 2026 by yashaswikarnati Loading…
4 tasks
Mdp
#1849 opened Jan 29, 2026 by shanmugamr1992 Loading…
4 tasks
Add Muon post-training support documentation Improvements or additions to documentation
#1848 opened Jan 29, 2026 by ashors1 Draft
4 tasks
feat: Added save_optimizer flag to control saving optimizer or not in checkpointing CI:L0 Run doctests and unit tests community-request needs-follow-up Issue needs follow-up
#1843 opened Jan 29, 2026 by odedovadia Loading…
1 task
feat: enforce monotonicity config option
#1840 opened Jan 29, 2026 by cmunley1 Loading…
4 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.