Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Clean up model update group on worker exit
#5325 opened Mar 20, 2026 by AmineDiro Loading…
5 tasks
(5/5) async grpo metrics
#5322 opened Mar 20, 2026 by AmineDiro Loading…
(3/5) Cancel Stale inflight tasks
#5320 opened Mar 20, 2026 by AmineDiro Loading…
Expand the list of attention implementations compatible with packing
#5316 opened Mar 19, 2026 by mariosasko Loading…
1 of 5 tasks
docs: clarify PPO entropy metrics in PPO trainer docs
#5289 opened Mar 14, 2026 by biefan Loading…
Add reference to DeepSeekMath in accuracy_reward docstring
#5287 opened Mar 13, 2026 by qgallouedec Loading…
5 tasks
Add Cursor rules from AGENTS.md
#5280 opened Mar 12, 2026 by qgallouedec Loading…
Add Nemotron 3 to tests via tiny model
#5278 opened Mar 12, 2026 by sergiopaniego Loading…
5 tasks
Centralize AI agent templates in .ai
#5268 opened Mar 10, 2026 by qgallouedec Loading…
async streaming grpo w prefetch
#5250 opened Mar 9, 2026 by winglian Loading…
5 tasks
Update openenv examples to use environment_factory
#5235 opened Mar 6, 2026 by sergiopaniego Loading…
3 of 8 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.