v2.5.0

modelscope/ms-swift

版本发布时间: 2024-10-10 10:21:04

modelscope/ms-swift最新发布版本:v2.5.0(2024-10-10 10:21:04)

English Version

New Features:

Support for GPTQ & AWQ quantization of multimodal LLMs.
Support for dynamic addition of gradient checkpointing in the ViT section to reduce memory consumption.
Support for multimodal model pre-training.

New Models:

llama3.2, llama3.2-vision series
got-ocr2
llama3.1-omni
ovis1.6-gemma2
pixtral-12b
telechat2-115b
mistral-small-inst-2409

New Datasets:

egoschema

中文版

新特性：

支持多模态LLM的gptq&awq量化.
支持动态在vit部分增加gradient_checkpointing, 减少显存消耗.
支持多模态模型预训练.

新模型：

llama3.2, llama3.2-vision系列
got-ocr2
llama3.1-omni
ovis1.6-gemma2
pixtral-12b
telechat2-115b
mistral-small-inst-2409

新数据集：

egoschema

What's Changed

fix win32 quote by @tastelikefeet in https://github.com/modelscope/ms-swift/pull/2065
Fix yi template by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2067
fix rlhf zero3 by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2072
Update qwen2-vl最佳实践.md by @Digital2Slave in https://github.com/modelscope/ms-swift/pull/2058
fix RLHF & max_length by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2075
Support Mistral-small-inst-2409 by @DaozeZhang in https://github.com/modelscope/ms-swift/pull/2077
dynamic vit gradient_checkpointing by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2071
fix qwen2.5 template by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2081
fix multiprocess remove_columns by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2088
Support for fine-tuning Pixtral-12B. by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2090
fix vllm tokenizer by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2099
Fix the issue with media_offset in owl3 when batch_size > 1. by @LukeForeverYoung in https://github.com/modelscope/ms-swift/pull/2100
fix deploy openai compat by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2101
fix dataset preprocess by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2102
fix cpu infer device_map by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2103
fix infer device_map by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2105
Support for fine-tuning Llama 3.1 Omni. by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2106
support vllm & qwen2-vl video by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2110
Fix qwen2-vl zero2/3 by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2114
fix qwen2-audio by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2116
[TorchAcc] fix: fix find_labels and can_return_loss by @baoleai in https://github.com/modelscope/ms-swift/pull/2120
support got-ocr2 by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2123
Support for fine-tuning and deployment of the Llama 3.2 series models. by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2130
Support fine-tuning MLLama. by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2132
fix not impl bug by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2134
Compat vllm & qwen2-vl by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2136
fix requirements by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2137
fix model_type by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2138
fix deploy vllm by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2141
fix docs by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2142
Fix VLM lora by @tastelikefeet in https://github.com/modelscope/ms-swift/pull/2140
support mllm pt by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2146
[TorchAcc] fix: fix save config and additional file for swift and peft by @baoleai in https://github.com/modelscope/ms-swift/pull/2149
update quant_device_map by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2154
fix qwen2-audio by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2157
fix template by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2160
compat trl==0.11 by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2169
Support for Egoschema, a new video dataset by @DaozeZhang in https://github.com/modelscope/ms-swift/pull/2173
Update FAQ by @slin000111 in https://github.com/modelscope/ms-swift/pull/2165
fix mplug-owl3 infer by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2175
Support quant mllm by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2177
update setup.py by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2205
fix bugs by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2207
support telechat2 by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2210
Support ovis 1.6 by @Jintao-Huang in https://github.com/modelscope/ms-swift/pull/2211

New Contributors

@Digital2Slave made their first contribution in https://github.com/modelscope/ms-swift/pull/2058
@LukeForeverYoung made their first contribution in https://github.com/modelscope/ms-swift/pull/2100

Full Changelog: https://github.com/modelscope/ms-swift/compare/v2.4.2...v2.5.0

相关地址：原始地址下载(tar) 下载(zip)

查看：2024-10-10发行的版本