-
Notifications
You must be signed in to change notification settings - Fork 2.6k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add Gemma/Gemma2 training chat templates with generation markers
#5523
opened Apr 11, 2026 by
ps-abhi
Loading…
5 of 8 tasks
Drop, don't truncate, overlong tool results in GRPOTrainer
#5521
opened Apr 11, 2026 by
qgallouedec
Member
Loading…
feat(glm-4-moe): Add
{% generation %} markers for training chat template
#5519
opened Apr 10, 2026 by
casinca
Contributor
Loading…
5 of 8 tasks
Fix
supports_tool_calling falsely accepting templates that drop assistant tool_calls
#5517
opened Apr 10, 2026 by
qgallouedec
Member
Loading…
[WIP] Fix OnlineDPO vLLM server completion handling
#5516
opened Apr 10, 2026 by
JohnGiorgi
Contributor
•
Draft
5 of 8 tasks
Remove unused dependencies for judges from dev requirements
#5515
opened Apr 10, 2026 by
qgallouedec
Member
Loading…
Fix CI dependency installs to use a single resolve
#5513
opened Apr 10, 2026 by
qgallouedec
Member
Loading…
Expose trainer dataset type metadata
#5512
opened Apr 10, 2026 by
JohnGiorgi
Contributor
Loading…
5 of 8 tasks
Remove xfail condition for Gemma4 response_schema regex bug
#5510
opened Apr 10, 2026 by
qgallouedec
Member
Loading…
Simplify role handling in prepare_multimodal_messages
#5508
opened Apr 10, 2026 by
albertvillanova
Member
Loading…
[SSD] Added SSD trainer in experimental
#5505
opened Apr 10, 2026 by
kashif
Collaborator
Loading…
8 tasks
Fix is_liger_kernel_available compatibility with liger-kernel-nightly
#5478
opened Apr 8, 2026 by
flofiz
Loading…
3 of 6 tasks
Support messages with images in prepare_multimodal_messages
#5474
opened Apr 8, 2026 by
albertvillanova
Member
Loading…
Fix the tests related to Flash Attention 2
#5473
opened Apr 8, 2026 by
YangKai0616
Contributor
Loading…
2 tasks
[docs] Add hardware requirements note to quickstart
#5472
opened Apr 7, 2026 by
pqbas
Loading…
5 of 8 tasks
[docs] Clarify dtype defaults between trf v5 and TRL
#5457
opened Apr 4, 2026 by
casinca
Contributor
Loading…
2 of 4 tasks
[AsyncGRPO] Support async tool calls in AsyncRolloutWorker
#5446
opened Apr 3, 2026 by
PoilZero
Loading…
5 of 8 tasks
feat(async-grpo): add sampling parameter parity
#5418
opened Mar 31, 2026 by
kdubovikov
Contributor
Loading…
4 of 8 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.