Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Remove extra whitespaces
#1647 opened May 17, 2024 by qgallouedec Draft
Prototype Dataset Processor
#1646 opened May 16, 2024 by vwxyzjn Draft
[DRAFT] Vllm integration
#1628 opened May 7, 2024 by vwxyzjn Draft
Integrate f-divergence to DPO (Follow up)
#1610 opened May 1, 2024 by 1485840691 Loading…
Adds Online DPO
#1605 opened Apr 30, 2024 by edbeeching Draft
Minimal examples
#1603 opened Apr 30, 2024 by vwxyzjn Draft
[WIP] Add WinRateCallback
#1598 opened Apr 29, 2024 by lewtun Draft
2 of 5 tasks
🤫 TR-DPO implementation
#1593 opened Apr 26, 2024 by syrn1k Loading…
Added DataCollatorForMultiTurnCompletions
#1592 opened Apr 26, 2024 by AswanthManoj Loading…
[WIP] Unify Policy Trainers
#1586 opened Apr 25, 2024 by lapp0 Draft
4 tasks
Added Reward Backpropogation Support
#1585 opened Apr 25, 2024 by mihirp1998 Loading…
A pull request for POVIDTrainer
#1573 opened Apr 23, 2024 by gzcch Loading…
Apply deprecated evaluation_strategy
#1559 opened Apr 18, 2024 by muellerzr Loading…
PPO / Reinforce Trainers
#1540 opened Apr 15, 2024 by vwxyzjn Loading…
Adds reward bootstrapping to PPOTrainer
#1536 opened Apr 13, 2024 by ejmejm Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.