MyGit

v0.7.5

huggingface/trl

版本发布时间: 2023-12-22 21:09:41

huggingface/trl最新发布版本:v0.11.1(2024-09-25 00:13:05)

IPO & KTO & cDPO loss, DPOTrainer enhancements, automatic tags for xxxTrainer

Important enhancements for DPOTrainer

This release introduces many new features in TRL for DPOTrainer:

Automatic xxxTrainer tagging on the Hub

Now, trainers from TRL pushes automatically tags trl-sft, trl-dpo, trl-ddpo when pushing models on the Hub

unsloth 🤝 TRL

We encourage users to try out unsloth library for faster LLM fine-tuning using PEFT & TRL's SFTTrainer and DPOTrainer

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/trl/compare/v0.7.4...v0.7.5

相关地址:原始地址 下载(tar) 下载(zip)

查看:2023-12-22发行的版本