v0.2.0
版本发布时间: 2023-01-25 22:04:19
huggingface/trl最新发布版本:v0.11.1(2024-09-25 00:13:05)
Highlights
- General decoder model support in addition to GPT-2 in https://github.com/lvwerra/trl/pull/53
- Encoder-decoder model support (such as T5) in https://github.com/lvwerra/trl/pull/93
- New, shiny docs with the
doc-builder
in https://github.com/lvwerra/trl/pull/59 -
push_to_hub
with PPOTrainer in https://github.com/lvwerra/trl/pull/68 - Simple reference model creation with layer sharing in https://github.com/lvwerra/trl/pull/61
What's Changed
- Remove
nbdev
dependency by @younesbelkada in https://github.com/lvwerra/trl/pull/52 - Adds github actions and dummy test by @edbeeching in https://github.com/lvwerra/trl/pull/55
- Update README.md by @Keith-Hon in https://github.com/lvwerra/trl/pull/51
- Update README.md by @TristanThrush in https://github.com/lvwerra/trl/pull/49
- Adds Python highlighting to the code block by @JulesGM in https://github.com/lvwerra/trl/pull/45
-
xxxForCausalLM
support by @younesbelkada in https://github.com/lvwerra/trl/pull/53 - [
VHead
] Fix slow convergence issue by @younesbelkada in https://github.com/lvwerra/trl/pull/60 - add docbuilder skeleton by @lvwerra in https://github.com/lvwerra/trl/pull/59
- fix docs workflow by @lvwerra in https://github.com/lvwerra/trl/pull/63
-
accelerate
integration by @younesbelkada in https://github.com/lvwerra/trl/pull/58 - add create_reference_model by @lvwerra in https://github.com/lvwerra/trl/pull/61
- Improve Makefile and code quality by @lvwerra in https://github.com/lvwerra/trl/pull/62
- Relax requirements by @lvwerra in https://github.com/lvwerra/trl/pull/66
- modeling - change namings by @younesbelkada in https://github.com/lvwerra/trl/pull/65
- [
PPOTrainer
] make the reference model optional by @younesbelkada in https://github.com/lvwerra/trl/pull/67 - Improvements 1a by @edbeeching in https://github.com/lvwerra/trl/pull/70
- update GitHub actions to
main
by @lvwerra in https://github.com/lvwerra/trl/pull/77 - [core] refactor
step
method by @younesbelkada in https://github.com/lvwerra/trl/pull/76 - [
PPOTrainer
] Support generic optimizers by @younesbelkada in https://github.com/lvwerra/trl/pull/78 - Update sentiment_tuning.mdx by @eltociear in https://github.com/lvwerra/trl/pull/69
- Remove references to "listify_batch" by @xiaoyesoso in https://github.com/lvwerra/trl/pull/81
- Collater -> collator by @LysandreJik in https://github.com/lvwerra/trl/pull/88
- Model as kwarg in pipeline by @LysandreJik in https://github.com/lvwerra/trl/pull/89
- Small typo correction by @LysandreJik in https://github.com/lvwerra/trl/pull/87
- [API] Make
dataset
attribute optional by @younesbelkada in https://github.com/lvwerra/trl/pull/85 - [Doc] Improve docs by @younesbelkada in https://github.com/lvwerra/trl/pull/91
- [core] Push
v_head
when usingAutoModelForCausalLMWithValueHead
by @younesbelkada in https://github.com/lvwerra/trl/pull/86 - [core] remove
wandb
dependency by @younesbelkada in https://github.com/lvwerra/trl/pull/92 - add logo by @lvwerra in https://github.com/lvwerra/trl/pull/95
- Encoder-Decoder models support by @younesbelkada in https://github.com/lvwerra/trl/pull/93
- Fix docs hyperlinks by @lewtun in https://github.com/lvwerra/trl/pull/98
- [API] LR scheduler support by @younesbelkada in https://github.com/lvwerra/trl/pull/96
- Version should have
dev0
unless it is a release version by @mishig25 in https://github.com/lvwerra/trl/pull/99 - [core] improve API by @younesbelkada in https://github.com/lvwerra/trl/pull/97
- Add push to Hub for PPOTrainer by @lewtun in https://github.com/lvwerra/trl/pull/68
- [
core
] Advise to usefbs=1
by @younesbelkada in https://github.com/lvwerra/trl/pull/102 - [Doc] New additions by @younesbelkada in https://github.com/lvwerra/trl/pull/105
- restructure examples by @lvwerra in https://github.com/lvwerra/trl/pull/107
- Fix nits & missing things by @younesbelkada in https://github.com/lvwerra/trl/pull/108
- Convert notebook 05 by @edbeeching in https://github.com/lvwerra/trl/pull/80
New Contributors
- @lvwerra made their first contribution in https://github.com/lvwerra/trl/pull/2
- @vblagoje made their first contribution in https://github.com/lvwerra/trl/pull/16
- @dependabot made their first contribution in https://github.com/lvwerra/trl/pull/26
- @younesbelkada made their first contribution in https://github.com/lvwerra/trl/pull/52
- @edbeeching made their first contribution in https://github.com/lvwerra/trl/pull/55
- @Keith-Hon made their first contribution in https://github.com/lvwerra/trl/pull/51
- @TristanThrush made their first contribution in https://github.com/lvwerra/trl/pull/49
- @JulesGM made their first contribution in https://github.com/lvwerra/trl/pull/45
- @eltociear made their first contribution in https://github.com/lvwerra/trl/pull/69
- @xiaoyesoso made their first contribution in https://github.com/lvwerra/trl/pull/81
- @LysandreJik made their first contribution in https://github.com/lvwerra/trl/pull/88
- @lewtun made their first contribution in https://github.com/lvwerra/trl/pull/98
- @mishig25 made their first contribution in https://github.com/lvwerra/trl/pull/99
Full Changelog: https://github.com/lvwerra/trl/commits/v0.2.0