v0.3.8
版本发布时间: 2024-07-24 18:38:57
OpenRLHF/OpenRLHF最新发布版本:v0.4.2(2024-08-29 19:39:57)
Changes
- Default to using
torch.cuda.device_count()
fortp_size
inbatch_inference
@tongyx361 - Improved description of
tqdm
@tongyx361 - Fixed loading dataset from local text files @tongyx361
- Added support for Llama3.1 @xiaoxigua999
- Added
--packing_samples
support for all HF models (SFT/DPO/RM training) @xiaoxigua999 - Added
--nll_loss_coef
(for chosen response) support for DPO @xiaoxigua999