MyGit

v0.7.11

huggingface/trl

版本发布时间: 2024-02-16 16:22:47

huggingface/trl最新发布版本:v0.11.1(2024-09-25 00:13:05)

DPO important fixes

We fixed issues with respect to IPO loss, leading to consistent results according to newest experiements:

We also fixed important bugs with respect to DPO / PEFT and Flash Attention

Data processing is now faster for multi-GPU envs

Other DPO bugfixes:

Faster data processing and other enhancements:

Automatic tagging for all models

Models now gets tagged correctly even if users do not call trainer.push_to_hub()

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/trl/compare/v0.7.10...v0.7.11

相关地址:原始地址 下载(tar) 下载(zip)

查看:2024-02-16发行的版本