MyGit

v0.8.2

huggingface/trl

版本发布时间: 2024-04-11 21:51:28

huggingface/trl最新发布版本:v0.11.1(2024-09-25 00:13:05)

ORPO Trainer & Vision LLMs support for SFTTrainer, KTO fixes

This release includes two new trainers: ORPO from KAIST and CPO
The release also includes Vision LLM such as Llava support for SFTTrainer, please see: https://github.com/huggingface/trl/blob/main/examples/scripts/vsft_llava.py for more details

ORPO Trainer

CPO Trainer

VLLMs support for SFTTrainer

You can now use SFTTrainer to fine-tune VLLMs such as Llava ! See: https://github.com/huggingface/trl/blob/main/examples/scripts/vsft_llava.py for more details

KTO Fixes

Many fixes were introduced for the KTOTrainer:

10x PPO !

Other fixes

New Contributors

Full Changelog: https://github.com/huggingface/trl/compare/v0.8.1...v0.8.2

相关地址:原始地址 下载(tar) 下载(zip)

查看:2024-04-11发行的版本