v0.3.2
版本发布时间: 2024-07-02 03:05:50
OpenRLHF/OpenRLHF最新发布版本:v0.4.2(2024-08-29 19:39:57)
Changes
- Fixed max_model_len @openllmai0
- Added support for tokenizer chat templates in train_rm.py and reward_dataset.py. @mickelliu
- Introduced the [--enable_prefix_caching] (https://github.com/OpenLLMAI/OpenRLHF/commit/bbbae8352cdbf26f990885a4e1640e82d0fbeaa8) option. @openllmai0
- Added support for saving the value network with [--save_value_network] (https://github.com/OpenLLMAI/OpenRLHF/commit/e24c53fa2352964cd4638e8ceeede5bd0c6f47ce). @openllmai0
- Supported specifying the number of samples per prompt with --n_samples_per_prompt. @openllmai0