MyGit

v0.6.0

coqui-ai/TTS

版本发布时间: 2022-03-07 22:44:06

coqui-ai/TTS最新发布版本:v0.22.0(2023-12-12 23:11:16)

What's Changed

Tokenizer API

Tokenizer API is defined by the TTSTokenizer class. It is intended to provide all the text processing functionalities to a tts model. New tokenizers can also be added by subclassing the TTSTokenizer class.

Phonemizer API

Phonemizer API is defined by the BasePhonemizer class and implemented by the ESpeak and Gruut wrappers, ZH_CH, JP_JA phonemizers. New phonemizers can be added by implementing the BasePhonemizer class.

BaseCharacters

BaseCharacters class provides an API to define the model vocabulary and provide the dictionary to map characters to token IDs and back. There are two pre-defined classes inheriting from BaseCharacters. IPAPhonemes and Graphemes that respectively define the IPA phoneme character set for models using phonemes and grapheme set for models using raw characters.

Punctuations class

Punctuations class to strip out punctuations and restore them when needed.

Language specific text normalization routines under TTS.tts.utils.text

Under TTS.tts.utils.text there are folders for each language to accommodate the text normalization routines that are designed for the language.

👟Trainer

We separate the trainer as a new repo 👟Trainer. It is a general-purpose model trainer for Pytorch with certain design choices in mind.

It is a very early-stage and monolithic library currently. Feel free to share your ✨feedback✨ and ✨contribute✨.

VITS implementation update

With this version of VITS model, we get rid of some of the issues that affect the model performance. It also illustrates well how you could adapt any open-source model implementation to 🐸TTS and 👟Trainer without even knowing the rest for 🐸TTS library.

Full Changelog: https://github.com/coqui-ai/TTS/compare/v0.5.0...v0.6.0

New Models

相关地址:原始地址 下载(tar) 下载(zip)

查看:2022-03-07发行的版本