Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add Gemma/Gemma2 training chat templates with generation markers
#5523 opened Apr 11, 2026 by ps-abhi Loading…
5 of 8 tasks
Drop, don't truncate, overlong tool results in GRPOTrainer
#5521 opened Apr 11, 2026 by qgallouedec Member Loading…
Fix add_response_schema for VLM processors
#5520 opened Apr 11, 2026 by qgallouedec Member Loading…
feat(glm-4-moe): Add {% generation %} markers for training chat template
#5519 opened Apr 10, 2026 by casinca Contributor Loading…
5 of 8 tasks
Add LLaMA 3.1 and 3.2 tool calling support
#5518 opened Apr 10, 2026 by qgallouedec Member Loading…
[WIP] Fix OnlineDPO vLLM server completion handling
#5516 opened Apr 10, 2026 by JohnGiorgi Contributor Draft
5 of 8 tasks
Remove unused dependencies for judges from dev requirements
#5515 opened Apr 10, 2026 by qgallouedec Member Loading…
Fix CI dependency installs to use a single resolve
#5513 opened Apr 10, 2026 by qgallouedec Member Loading…
Expose trainer dataset type metadata
#5512 opened Apr 10, 2026 by JohnGiorgi Contributor Loading…
5 of 8 tasks
Remove xfail condition for Gemma4 response_schema regex bug
#5510 opened Apr 10, 2026 by qgallouedec Member Loading…
Simplify role handling in prepare_multimodal_messages
#5508 opened Apr 10, 2026 by albertvillanova Member Loading…
[TPO] experimental TPO trainer
#5506 opened Apr 10, 2026 by kashif Collaborator Loading…
8 tasks
[SSD] Added SSD trainer in experimental
#5505 opened Apr 10, 2026 by kashif Collaborator Loading…
8 tasks
Set _tokenizer as trainer attribute
#5489 opened Apr 9, 2026 by albertvillanova Member Loading…
Deprecate eos_token config parameter
#5481 opened Apr 9, 2026 by albertvillanova Member Loading…
Fix is_liger_kernel_available compatibility with liger-kernel-nightly
#5478 opened Apr 8, 2026 by flofiz Loading…
3 of 6 tasks
Fix the tests related to Flash Attention 2
#5473 opened Apr 8, 2026 by YangKai0616 Contributor Loading…
2 tasks
[docs] Add hardware requirements note to quickstart
#5472 opened Apr 7, 2026 by pqbas Loading…
5 of 8 tasks
GOLDTrainer VLM support
#5461 opened Apr 6, 2026 by Strongich Loading…
4 of 8 tasks
[docs] Clarify dtype defaults between trf v5 and TRL
#5457 opened Apr 4, 2026 by casinca Contributor Loading…
2 of 4 tasks
[AsyncGRPO] Support async tool calls in AsyncRolloutWorker
#5446 opened Apr 3, 2026 by PoilZero Loading…
5 of 8 tasks
FIPO loss
#5434 opened Apr 2, 2026 by kdubovikov Contributor Loading…
4 of 8 tasks
feat(async-grpo): add sampling parameter parity
#5418 opened Mar 31, 2026 by kdubovikov Contributor Loading…
4 of 8 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.