v0.4.0
版本发布时间: 2024-07-31 15:40:41
OpenRLHF/OpenRLHF最新发布版本:v0.4.2(2024-08-29 19:39:57)
Changes
- Added support for checkpointing, including states for Optimizer, Model, Scheduler, and DataLoader. @xiaoxigua999
- Added support for the Remote Reward Model. @catqaq @xiaoxigua999
- Set
add_special_tokens=False
in the tokenizer. @xiaoxigua999 @ZhaofengWu - Added
learning rate
in the logs @xiaoxigua999