v0.4.1
版本发布时间: 2024-08-07 19:37:10
OpenRLHF/OpenRLHF最新发布版本:v0.4.2(2024-08-29 19:39:57)
What's Changed
- Rename wandb args in scripts by @coding-famer in https://github.com/OpenRLHF/OpenRLHF/pull/396
- Speed Up Data Processing by Using Multi-Processing in Dataset.map by @Ricardokevins and @xiaoxigua999 in https://github.com/OpenRLHF/OpenRLHF/pull/412
- Update link to code in readme by @coding-famer in https://github.com/OpenRLHF/OpenRLHF/pull/414
- Fixed
input_template
for Iterative DPO and Rejection Sampling @xiaoxigua999 - Fixed
SFTDataset
for Continue Pretrain @xiaoxigua999
New Contributors
- @coding-famer made their first contribution in https://github.com/OpenRLHF/OpenRLHF/pull/396
- @Ricardokevins made their first contribution in https://github.com/OpenRLHF/OpenRLHF/pull/412
Full Changelog: https://github.com/OpenRLHF/OpenRLHF/compare/v0.4.0...v0.4.1