v0.3.2

版本发布时间: 2024-07-02 03:05:50

OpenRLHF/OpenRLHF最新发布版本:v0.4.2(2024-08-29 19:39:57)

Changes

Fixed max_model_len @openllmai0
Added support for tokenizer chat templates in train_rm.py and reward_dataset.py. @mickelliu
Introduced the [--enable_prefix_caching] (https://github.com/OpenLLMAI/OpenRLHF/commit/bbbae8352cdbf26f990885a4e1640e82d0fbeaa8) option. @openllmai0
Added support for saving the value network with [--save_value_network] (https://github.com/OpenLLMAI/OpenRLHF/commit/e24c53fa2352964cd4638e8ceeede5bd0c6f47ce). @openllmai0
Supported specifying the number of samples per prompt with --n_samples_per_prompt. @openllmai0