v1.5.1
版本发布时间: 2021-12-04 08:00:07
NVIDIA/NeMo最新发布版本:r2.0.0rc1(2024-08-16 05:55:14)
Features
- Minor updates to expose speaker id, pitch, and duration on export of FastPitch #3192, #3207
Known Issues
- Training of speaker models converge very slowly due to a bug (fixed in main: #3354)
- ASR training does not reach adequate WER due to bug in Numba Spec Augment (fixed in main : #3299). For details refer to https://github.com/NVIDIA/NeMo/issues/3288#issuecomment-1000766337 . For a temporary workaround, disable Numba Spec Augment with https://github.com/NVIDIA/NeMo/blob/main/nemo/collections/asr/modules/audio_preprocessing.py#L471 set to False in the config for SpecAugment in the yaml config. The fix will be part of 1.6.0.