v0.1.3
版本发布时间: 2024-01-08 19:37:53
OpenRLHF/OpenRLHF最新发布版本:v0.4.2(2024-08-29 19:39:57)
Changes
- Fixed Huggingface Reward model saving
- Improved
mask_mean
for loss function - Fixed
num_actions
andaction_mask
- Optimized PPO performance of example scripts (set micro_batch_size=4)