v0.4.0

版本发布时间: 2023-03-09 19:38:59

huggingface/trl最新发布版本:v0.11.1(2024-09-25 00:13:05)

`v0.4.0`: `peft` integration

Apply RLHF and fine-tune your favorite large model on consumer GPU using peft and trl ! Share also easily your trained RLHF adapters on the Hub with few lines of code

With this integration you can train gpt-neo-x (20B parameter model - 40GB in bfloat16) on a 24GB consumer GPU!

What's Changed

Allow running evaluate-toxicity with cpu by @jordimas in https://github.com/lvwerra/trl/pull/195
[core] Fix quality issue by @younesbelkada in https://github.com/lvwerra/trl/pull/197
Add 1.12.1 torch compatibility in sum method by @PanchenkoYehor in https://github.com/lvwerra/trl/pull/190
peft integration by @edbeeching in https://github.com/lvwerra/trl/pull/163
[core] Update dependency by @younesbelkada in https://github.com/lvwerra/trl/pull/206

New Contributors

@PanchenkoYehor made their first contribution in https://github.com/lvwerra/trl/pull/190

Full Changelog: https://github.com/lvwerra/trl/compare/v0.3.1...v0.4.0

相关地址：原始地址下载(tar) 下载(zip)

查看：2023-03-09发行的版本

v0.4.0

v0.4.0: peft integration

What's Changed

New Contributors

`v0.4.0`: `peft` integration