v0.1.6
版本发布时间: 2023-01-16 19:28:23
modelscope/FunASR最新发布版本:v0.3.0(2023-03-16 16:15:02)
Release Notes:
2023.1.16, funasr-0.1.6
- We release a new version model Paraformer-large-long, which integrate the VAD model, ASR, Punctuation model and timestamp together. The model could take in several hours long inputs.
- We release a new type model, VAD, which could predict the duration of none-silence speech. It could be freely integrated with any ASR models in Model Zoo.
- We release a new type model, Punctuation, which could predict the punctuation of ASR models's results. It could be freely integrated with any ASR models in Model Zoo.
- We release a new model, Data2vec, an unsupervised pretraining model which could be finetuned on ASR and other downstream tasks.
- We release a new model, Paraformer-Tiny, a lightweight Paraformer model which supports Mandarin command words recognition.
- We release a new type model, SV, which could extract speaker embeddings and further perform speaker verification on paired utterances. It will be supported for speaker diarization in the future version.
- We improve the pipeline of modelscope to speedup the inference, by integrating the process of build model into build pipeline.
- Various new types of audio input types are now supported by modelscope inference pipeline, including wav.scp, wav format, audio bytes, wave samples...
最新更新
- 2023年1月(1月16号发布):funasr-0.1.6, modelscope-1.2.0
- 上线新模型:
- Paraformer-large长音频模型,集成VAD、ASR、标点与时间戳功能,可直接对时长为数小时音频进行识别,并输出带标点文字与时间戳。
- 中文无监督预训练Data2vec模型,采用Data2vec结构,基于AISHELL-2数据的中文无监督预训练模型,支持ASR或者下游任务微调模型。
- 16k语音端点检测VAD模型,可用于检测长语音片段中有效语音的起止时间点。
- 中文标点预测通用模型,可用于语音识别模型输出文本的标点预测。
- 8K UniASR流式模型,8K UniASR模型,一种流式与离线一体化语音识别模型,进行流式语音识别的同时,能够以较低延时输出离线识别结果来纠正预测文本。
- Paraformer-large基于AISHELL-1微调模型、AISHELL-2微调模型,将Paraformer-large模型分别基于AISHELL-1与AISHELL-2数据微调。
- 说话人确认模型 ,可用于说话人确认,也可以用来做说话人特征提取。
- 小尺寸设备端Paraformer指令词模型,Paraformer-tiny指令词版本,使用小参数量模型支持指令词识别。
- 将原TensorFlow模型升级为Pytorch模型,进行推理,并支持微调定制,包括:
- 16K 模型:Paraformer中文、Paraformer-large中文、UniASR中文、UniASR-large中文、UniASR中文流式模型、UniASR方言、UniASR方言流式模型、UniASR日语、UniASR日语流式模型、UniASR印尼语、UniASR印尼语流式模型、UniASR葡萄牙语、UniASR葡萄牙语流式模型、UniASR英文、UniASR英文流式模型、UniASR俄语、UniASR俄语流式模型、UniASR韩语、UniASR韩语流式模型、UniASR西班牙语、UniASR西班牙语流式模型、UniASR粤语简体、UniASR粤语简体流式模型、
- 8K 模型:Paraformer中文、UniASR中文、UniASR中文流式模型
- 上线新模型:
New Contributors
- @nichongjia-2007 made their first contribution in https://github.com/alibaba-damo-academy/FunASR/pull/27
Full Changelog: https://github.com/alibaba-damo-academy/FunASR/compare/v0.1.4...v0.1.6