v1.5.0

modelscope/modelscope

版本发布时间: 2023-04-18 18:40:10

modelscope/modelscope最新发布版本:v1.18.1(2024-09-21 22:08:11)

中文版本

新模型推荐

序号	模型名称&快捷链接
1	ResNet50行人结构化属性识别模型
2	DamoFD人脸检测关键点模型-0.5G
3	CAM++说话人确认-英文-VoxCeleb-16k
4	一种具有自我评估能力的机器翻译-中英-通用领域-large

高亮功能

支持 lora 生成扩散模型高效调优
增加 llama 模型
支持推送到 hub 的能力
为 chatglm-6B 类模型支持 chat 任务
增加常用模型和任务的 cli 调用 example

功能列表

支持了对使用 megatron tensor 并行模型保存的 checkpoint 拆分合并
支持 lora 生成扩散模型高效调优
增加 pedestrian attribute recognition 模型
增加 damofd 系列模型
增加 llama 模型
支持推送到 hub 的能力
增加 speaker cam++ 模型
增加 head 支持 XlmRoberta 模型
增加 canmt translation 模型
为 chatglm-6B 类模型支持 chat 任务

功能提升

funasr 更新到 0.4.0 版本，支持 mac 运行
plugin 支持 trainer
fid_dialouge_pipeline 新增 3.7B 模型
增加 Mgeo 模型 token classification 任务的训练示例
增加 PALM 模型 text generation 任务的训练示例
增加 CLIP 模型 multi-modal embedding 任务的训练示例
speech kws nearfield 训练增加梯度累积配置
重构优化人脸重建模型相关代码
更新图像着色指标
更新 github issue 模版

BugFix

修复文本生成任务模型 generate 报错
修复人脸重建模型 pipeline 报错
修复 pipeline 重复输出 warning 的问题
修复 plugin import 包失败时报错
修复 speech kws nearfield 多卡训练报错
修复生成模型输出英文结果缺少空格的问题
修复 jsonplus 不支持 ndarray 的问题

English Version

New Model List and Quick Access

No	Model Name & Link
1	ResNet50 pedestrian-attribute-recognition image
2	DamoFD face-detection 0.5G
3	Speech cam++ English-VoxCeleb-16k
4	Canmt translation with self evaluation zh2en-large

Highlight

Add efficient tunner modules
Add llama to mslib from hf
Support the ability to push to hub
Add task chat for all chat models, like chatglm-6B
Add common models and tasks cli call example

Breaking changes

Feature

Support split and merge for megatron_base model
Add efficient tunner modules
Add pedestrian attribute recognition model
Add damofd model
Add llama to mslib from hf
Support the ability to push to hub
Add speaker model cam++ for speaker verification task
New head support for XlmRoberta model
Add canmt translation model
Add task chat for all chat models, like chatglm-6B

Improvements

support funasr for mac
Plugin support trainer
Add 3.7B size model for fid_dialouge_pipeline
Add token classification example for MGeo
Add PALM finetune example
Add multi-modal embedding example for CLIP
Speech kws nearfield training add gradient accumulation config
Update face reconstruction to HRN(CVPR2023)
Update image colorization metric
Update issue templates

BugFix

Fix generate for ModelForTextGeneration
Fix issues for face pipeline
Fix keep printing warnings in pipeline
Bug fixed in plugin
Fix speech kws nearfield training with multi-gpu
Fix english words without space
Fix jsonplus, support ndarray

相关地址：原始地址下载(tar) 下载(zip)

查看：2023-04-18发行的版本