v1.4.0
版本发布时间: 2021-10-02 08:49:13
NVIDIA/NeMo最新发布版本:r2.0.0rc1(2024-08-16 05:55:14)
Features
- Improved speaker clustering #2729
- Upgrade to NVIDIA PyTorch 21.08 container #2799
- RNNT mAES beam search support #2802
- Transfer learning for new speakers #2684
- Simplify speaker scripts #2777
- Perceiver-encoder architecture #2737
- Relative paths in tarred datasets #2776
- Torch only TTS package #2643
- Inverse text normalization for Spanish #2489
Tutorial Notebooks
- Duration and pitch control for TTS # 2700
Bug fixes
- Fixed max delta generation #2727
- Waveglow export #2671, #2699
Contributors
@tango4j @titu1994 @paarthneekhara @nithinraok @michalivne @erastorgueva-nv @borisfom @blisc (some contributors may not be listed explicitly)