v.0.10.3
版本发布时间: 2021-10-11 20:37:54
espnet/espnet最新发布版本:v.202409(2024-10-01 14:28:01)
New Features
- [New Features][ESPnet1][RNNT][Installation][README] FastEmit support #3591 by @b-flo
- [New Features][ESPnet2][ASR] Add ASR portable evaluation script #3569 by @kan-bayashi
- [New Features][ESPnet2][README] EEND-EDA model for diarization task #3621 by @YushiUeda
Bugfix
- [Bugfix][ESPnet1] Fix /usr/bin/env bash -e #3651 by @kamo-naoyuki
- [Bugfix][ESPnet1] ctc loss using dropout layer since .eval() will not work for F.dropout #3539 by @zh794390558
- [Bugfix][ESPnet2] Minor fix of
evaluate_asr.sh
#3596 by @kan-bayashi - [Bugfix][ESPnet2][ASR] wav2vec2_encoder bug fix #3545 by @simpleoier
- [Bugfix][ESPnet2][README][SSL] Fix some issues of #3512 and add README.md to librispeech/ssl1 recipe. #3572 by @Jzmo
- [Bugfix][ESPnet2][TTS] Bug fix the attribute registration in VITS generator #3573 by @kan-bayashi
- [Bugfix][ESPnet2][TTS] Fix pyopenjtalk_g2p_accent(_with_pause) #3555 by @zzxiang
Recipe
- [Recipe][ESPnet1][ASR][RNNT] Update Transducer recipes #3465 by @b-flo
- [Recipe][ESPnet1][ST] Clean libri-trans #3540 by @hirofumi0810
- [Recipe][ESPnet2][ASR][README] Dan aishell4 branch #3585 by @DanBerrebbi
- [Recipe][ESPnet2][ASR][README] update pretrained models of librispeech using hubert/wav2vec2 #3568 by @simpleoier
- [Recipe][ESPnet2][SLU][README] Add slu snips data receipe #3407 by @yuekaizhang
- [Recipe][ESPnet2][TTS] Update GAN-TTS based configurations #3570 by @kan-bayashi
- [Recipe][ESPnet2][TTS][README] Add initial VITS results for JSUT #3550 by @kan-bayashi
- [Recipe][ESPnet2][TTS][README] Add つくよみちゃんコーパス recipe #3552 by @kan-bayashi
- [Recipe][ESPnet2][TTS][README] IndicSpeech TTS Scripts #3435 by @peter-yh-wu
- [Recipe][ESPnet2][TTS][README] Update ESPnet2-TTS results #3578 by @kan-bayashi
- [Recipe][ESPnet2][TTS][README] Update JSUT and JVS results #3553 by @kan-bayashi
- [Recipe][ESPnet2][TTS][README] Update LJSpeech and CSMSC results #3560 by @kan-bayashi
- [Recipe][ESPnet2][TTS][README] Update TTS results #3615 by @kan-bayashi
- [Recipe][ESPnet2][TTS][README] Update TTS results #3648 by @kan-bayashi
- [Recipe][ESPnet2][TTS][README] Update VCTK results #3581 by @kan-bayashi
- [Recipe][ESPnet2][TTS][README] Update pret-trained model for TTS recipes #3590 by @ftshijt
- [Recipe][ESPnet2][TTS][README] update kss recipe with new result. #3589 by @windtoker
- [Recipe][ESPnet2][TTS][Typo] Fix typo
egs2/jtubespeech/tts1
#3564 by @kan-bayashi - [Recipe][ESPnet2][TTS][Typo] Update JVS README #3554 by @kan-bayashi
Enhancement
- [Enhancement][ESPnet2][SE][Refactoring] Add PyTorch Builtin Complex Support in the Speech Enhancement Task #3355 by @Emrys365
- [Enhancement][ESPnet2][TTS] Hindi g2p #3579 by @peter-yh-wu
- [Enhancement][ESPnet2][TTS] Unify spks / lids / spk_embed_dim type #3551 by @kan-bayashi
- [Enhancement][ESPnet2][TTS] Update
evaluate_mcd.py
script #3566 by @kan-bayashi - [Enhancement][ESPnet2][TTS][Installation] Add the installer of tdmelodic pyopenjtalk #3561 by @kan-bayashi
- [Enhancement][ESPnet2][TTS][Installation][README] Update TTS objective eval scripts #3650 by @kan-bayashi
- [Enhancement][ESPnet2][TTS][README] Add a new Japanese G2P for TTS #3558 by @kan-bayashi
- [Enhancement][ESPnet2][TTS][README] Add a new english G2P #3597 by @kan-bayashi
Others
- [CI] Add codecov config and flags. #3603 by @ShigekiKarita
- [CI] Omit tools/ from code coverage. #3600 by @ShigekiKarita
- [CI] Split test_integration.sh #3599 by @ShigekiKarita
- [CI][ESPnet2][Installation][Refactoring] Make the installation of transformers optional #3622 by @kan-bayashi
- [CI][Installation] Add no-check-certificate option in PESQ installation #3649 by @kan-bayashi
- [CI][Installation][README][mergify] Change setup.py for pytorch1.9.1 #3636 by @kamo-naoyuki
- [Documentation][ESPnet1][RNNT] Fix/improve doc(string)s related to Transducer model #3623 by @b-flo
- [Documentation][ESPnet2][TTS][README] Update README of ESPnet2-TTS #3546 by @kan-bayashi
- [Documentation][ESPnet2][TTS][README] Update TTS README #3565 by @kan-bayashi
- [Documentation][ESPnet2][TTS][README] Update TTS fine-tuning README #3549 by @kan-bayashi
- [Typo][ESPnet2] Minor bug in format_wav_scp.py #3575 by @ftshijt
- [Typo][ESPnet2][TTS] update mismatch help info for tts #3602 by @ftshijt
Acknowledgements
Special thanks to @DanBerrebbi, @Emrys365, @Jzmo, @ShigekiKarita, @YushiUeda, @b-flo, @ftshijt, @hirofumi0810, @kamo-naoyuki, @kan-bayashi, @peter-yh-wu, @simpleoier, @windtoker, @yuekaizhang, @zh794390558, @zzxiang.