v0.8.0

版本发布时间: 2024-06-08 06:26:46

hiyouga/LLaMA-Factory最新发布版本:v0.9.0(2024-09-09 01:14:03)

Support single-node distributed training in Web UI
Add dropdown menu for easily resuming from checkpoints and picking saved configurations by @hiyouga and @hzhaoy in #4053
Support selecting checkpoints of full/freeze tuning
Add throughput metrics to LlamaBoard by @injet-zhou in #4066
Faster UI loading

Add KTO algorithm by @enji-zhou in #3785
Add SimPO algorithm by @hiyouga
Support passing max_lora_rank to the vLLM backend by @jue-jue-zi in #3794
Support preference datasets in sharegpt format and remove big files from git repo by @hiyouga in #3799
Support setting system messages in CLI inference by @ycjcl868 in #3812
Add num_samples option in dataset_info.json by @seanzhang-zhichen in #3829
Add NPU docker image by @dongdongqiang2018 in #3876
Improve NPU document by @MengqingCao in #3930
Support SFT packing with greedy knapsack algorithm by @AlongWY in #4009
Add llamafactory-cli env for bug report
Support image input in the API mode
Support random initialization via the train_from_scratch argument
Initialize CI

Fix RLHF for multimodal finetuning
Fix LoRA target in multimodal finetuning by @BUAADreamer in #3835
Fix yi template by @Yimi81 in #3925
Fix abort issue in LlamaBoard by @injet-zhou in #3987
Pass scheduler_specific_kwargs to get_scheduler by @Uminosachi in #4006
Fix hyperparameters helps by @xu-song in #4007
Update issue template by @statelesshz in #4011
Fix vllm dtype parameter
Fix exporting hyperparameters by @MengqingCao in #4080
Fix DeepSpeed ZeRO3 in PPO trainer
Fix #3108 #3387 #3646 #3717 #3764 #3769 #3803 #3807 #3818 #3837 #3847 #3853 #3873 #3900 #3931 #3965 #3971 #3978 #3992 #4005 #4012 #4013 #4022 #4033 #4043 #4061 #4075 #4077 #4079 #4085 #4090 #4120 #4132 #4137 #4139