v1.7.0
版本发布时间: 2024-03-09 15:54:03
modelscope/ms-swift最新发布版本:v2.5.0(2024-10-10 10:21:04)
New Features:
- Added support for swift export, enabling awq-int4 quantization and gpt-int2,3,4,8 quantization. Models can be pushed to the Modelscope Hub. You can view the documentation here.
- Enabled fine-tuning of awq quantized models.
- Enabled fine-tuning of aqlm quantized models.
- Added support for deploying LLM with infer_backend='pt'.
- Added web-ui with task management and visualization of training loss, eval loss, etc. Inference is accelerated using VLLM.
New Tuners:
- Lora+.
- LlamaPro.
New Models:
- qwen1.5 awq series.
- gemma series.
- yi-9b.
- deepseek-math series.
- internlm2-1_8b series.
- openbuddy-mixtral-moe-7b-chat.
- llama2 aqlm series.
New Datasets:
- ms-bench-mini.
- hh-rlhf-cn series.
- disc-law-sft-zh, disc-med-sft-zh.
- pileval.
What's Changed
- Fix vllm==0.3 deploy bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/412
- Support deepseek math by @Jintao-Huang in https://github.com/modelscope/swift/pull/413
- update support_vllm by @Jintao-Huang in https://github.com/modelscope/swift/pull/415
- fix zero3 & swift lora by @Jintao-Huang in https://github.com/modelscope/swift/pull/416
- Support peft0.8.0 by @tastelikefeet in https://github.com/modelscope/swift/pull/423
- update readme by @Jintao-Huang in https://github.com/modelscope/swift/pull/426
- fix pai open with 'a' by @Jintao-Huang in https://github.com/modelscope/swift/pull/430
- default load_best_model_at_end=False by @Jintao-Huang in https://github.com/modelscope/swift/pull/432
- support openbuddy mixtral by @Jintao-Huang in https://github.com/modelscope/swift/pull/437
- support gemma by @Jintao-Huang in https://github.com/modelscope/swift/pull/441
- Support ms bench mini by @Jintao-Huang in https://github.com/modelscope/swift/pull/442
- Add roadmap and contributing doc by @tastelikefeet in https://github.com/modelscope/swift/pull/431
- support peft format by @tastelikefeet in https://github.com/modelscope/swift/pull/438
- update contributing.md by @Jintao-Huang in https://github.com/modelscope/swift/pull/446
- fix link by @tastelikefeet in https://github.com/modelscope/swift/pull/447
- Fix rlhf dataset by @tastelikefeet in https://github.com/modelscope/swift/pull/451
- Add task management for webui by @tastelikefeet in https://github.com/modelscope/swift/pull/457
- Support swift export by @Jintao-Huang in https://github.com/modelscope/swift/pull/455
- Fix llm quantization docs by @Jintao-Huang in https://github.com/modelscope/swift/pull/458
- fix get_vllm_engine bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/463
- use cpu export by @Jintao-Huang in https://github.com/modelscope/swift/pull/462
- Fix llama2 generation config by @Jintao-Huang in https://github.com/modelscope/swift/pull/468
- Support editing model_id_or_path by @tastelikefeet in https://github.com/modelscope/swift/pull/469
- Support pt deploy by @Jintao-Huang in https://github.com/modelscope/swift/pull/467
- Fix swift deploy bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/470
- fix deploy dep by @Jintao-Huang in https://github.com/modelscope/swift/pull/471
- Support LLaMAPRO and LoRA+ by @tastelikefeet in https://github.com/modelscope/swift/pull/472
- Support internlm2 1.8b by @Jintao-Huang in https://github.com/modelscope/swift/pull/473
- fix deepseek moe device_map by @Jintao-Huang in https://github.com/modelscope/swift/pull/476
- fix peft compatible bug by @tastelikefeet in https://github.com/modelscope/swift/pull/482
- Fix deepspeed init bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/481
- fix scripts docs by @Jintao-Huang in https://github.com/modelscope/swift/pull/483
- Update
swift export
and update docs by @Jintao-Huang in https://github.com/modelscope/swift/pull/484 - support gptq export by @Jintao-Huang in https://github.com/modelscope/swift/pull/485
- fix docs & readme by @Jintao-Huang in https://github.com/modelscope/swift/pull/486
- fix app-ui bug by @Jintao-Huang in https://github.com/modelscope/swift/pull/488
- Support peft0.9 by @tastelikefeet in https://github.com/modelscope/swift/pull/490
- support torchrun_args for dpo cli and support web_ui model deployment by @slin000111 in https://github.com/modelscope/swift/pull/496
- Support transformers 4.33.0 by @tastelikefeet in https://github.com/modelscope/swift/pull/498
- Update deepspeed config by @Jintao-Huang in https://github.com/modelscope/swift/pull/500
- move docs to classroom by @tastelikefeet in https://github.com/modelscope/swift/pull/503
- Support yi 9b by @Jintao-Huang in https://github.com/modelscope/swift/pull/504
- Update yi sh by @Jintao-Huang in https://github.com/modelscope/swift/pull/506
Full Changelog: https://github.com/modelscope/swift/compare/v1.6.0...v1.7.0