v0.3.7
版本发布时间: 2024-07-20 19:01:32
OpenRLHF/OpenRLHF最新发布版本:v0.4.2(2024-08-29 19:39:57)
Changes
- Added support for
--packing_samples
in DPO/RM training (@xiaoxigua999) - Updated
reward_dataset
to correctly handleprompt_key
(@Nickydusk) - Updated versions of Transformers and DeepSpeed (@openllmai0)