-
Notifications
You must be signed in to change notification settings - Fork 270
Pull requests: NVIDIA-NeMo/RL
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: support top-p top-k in grpo
CI:Lfast
Runs a fast test suite and re-use nightly `main` container (but sync dependencies to PRs version)
docs: add prerequisites, troubleshooting, and build verification for GRPO quickstart
community-request
documentation
Improvements or additions to documentation
#2051
opened Mar 3, 2026 by
brluobt
Loading…
3 tasks
feat: caching gym venvs and apptainer for super branch
documentation
Improvements or additions to documentation
#2050
opened Mar 3, 2026 by
terrykong
Loading…
4 tasks
ci: Allow cancelling of unit tests
CI:docs
Run doctest
CI
Relating to CI
#2045
opened Mar 2, 2026 by
chtruong814
Loading…
4 tasks
chore: bumpup Megatron-Bridge submodule to main
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
Run CICD
#2039
opened Mar 1, 2026 by
ZhiyuLi-Nvidia
Loading…
4 tasks
feat: Add chunked linear ce loss function from hidden states
community-request
#2036
opened Feb 27, 2026 by
pengdurice
Loading…
3 of 4 tasks
chore: address deprecation warning for using a non-tuple sequence for multidimensional indexing
CI:L1
Run doctests, unit tests, and functional tests
feat: split validation statistics by task name
CI:L1
Run doctests, unit tests, and functional tests
documentation
Improvements or additions to documentation
#2019
opened Feb 24, 2026 by
yuki-97
Loading…
feat: adding wandb table log feature, showing concrete test samples
community-request
documentation
Improvements or additions to documentation
#2018
opened Feb 24, 2026 by
vinhngx
Loading…
4 tasks
ci: Test GB200 runner
CI:L1
Run doctests, unit tests, and functional tests
CI
Relating to CI
#2017
opened Feb 24, 2026 by
chtruong814
•
Draft
4 tasks
fix: use smoothed reward metric for VLM GRPO CLEVR convergence tests
CI:docs
Run doctest
CI:L0
Run doctests and unit tests
#2015
opened Feb 23, 2026 by
aroshanghias-nvd
Loading…
3 tasks
perf: Update moe_token_dispatcher_type default to alltoall
CI:L2
Run doctests, unit tests, functional tests, and convergence tests
#2004
opened Feb 22, 2026 by
parthmannan
Loading…
4 tasks
perf: shard concat overhead
community-request
#2002
opened Feb 21, 2026 by
pjo256
Loading…
3 of 4 tasks
feat: prefetch gym venvs
CI:L0
Run doctests and unit tests
documentation
Improvements or additions to documentation
#2000
opened Feb 21, 2026 by
terrykong
Loading…
4 tasks
refit latest update based on ahmads refactor
#1993
opened Feb 19, 2026 by
shanmugamr1992
Loading…
4 tasks
Add is_async to CheckpointingConfig TypedDict
community-request
#1991
opened Feb 19, 2026 by
dmvevents
Loading…
3 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.