Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Adjust padding in batch generation
#2251 opened Oct 18, 2024 by gaetanlop Loading…
3 tasks done
Conversational dataset support for KTOTrainer
#2248 opened Oct 18, 2024 by qgallouedec Loading…
5 tasks
Data mixer Integration
#2240 opened Oct 16, 2024 by August-murr Draft
3 of 5 tasks
[online-DPO] evaluaiton step error 🐛 bug Something isn't working
#2231 opened Oct 15, 2024 by kashif Loading…
Refactor DPO data processing
#2209 opened Oct 9, 2024 by qgallouedec Loading…
5 tasks
Add VAS to TRL ✨ enhancement New feature or request
#2195 opened Oct 7, 2024 by idanshen Loading…
[CGPO] CGPO Trainer (single task single objective) ✨ enhancement New feature or request
#2190 opened Oct 6, 2024 by gaetanlop Draft
9 of 11 tasks
Change KTO tokenization to use DPO's 🏋 KTO Related to KTO
#2187 opened Oct 6, 2024 by kawine Loading…
[CGPO] Mixture of judges 👨‍⚖️ judge Related to judges
#2159 opened Oct 3, 2024 by gaetanlop Loading…
4 tasks done
Remove graph breaks for torch.compile() in padding free branch in DataCollatorForCompletionOnlyLM 🐛 bug Something isn't working 🏋 SFT Related to SFT
#2158 opened Oct 3, 2024 by Abhishek-TAMU Loading…
1 of 5 tasks
populate SUPPORTED_COMMANDS cli
#2157 opened Oct 2, 2024 by grumpyp Loading…
4 of 5 tasks
[Open discusion] Multistep dataset
#2148 opened Oct 1, 2024 by qgallouedec Draft
4 tasks
DPO trainer supports num_logits_to_keep to save memory 🏋 DPO Related to DPO
#2129 opened Sep 26, 2024 by xyangk Loading…
3 of 5 tasks
Process-supervised RM Trainer
#2127 opened Sep 26, 2024 by gaetanlop Draft
5 tasks done
[SCoRE] initial score stage 1
#2115 opened Sep 24, 2024 by kashif Draft
Remove deprecated args in trainers
#2036 opened Sep 8, 2024 by qgallouedec Draft
5 tasks
feat: add support for packing tokenized datasets
#2011 opened Sep 3, 2024 by kmehant Loading…
3 of 5 tasks
allow masking on consecutive messages with same roles
#2000 opened Aug 31, 2024 by lsy641 Loading…
4 of 5 tasks
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.