v.0.10.5
版本发布时间: 2021-12-31 22:15:33
espnet/espnet最新发布版本:v.202409(2024-10-01 14:28:01)
New Features
- [New Features][ESPnet1][ASR] Implement self-conditioned CTC #3856 by @komatta-san
- [New Features][ESPnet2][ASR][CI][Installation] GTN CTC for ESPnet2 #3778 by @brianyan918
- [New Features][ESPnet2][ASR][Refactoring] [ESPnet2] Transducer #2533 by @b-flo
- [New Features][ESPnet2][README][Recipe] Frontends fusion (any type, any number, linear fusion only for now) for ASR in espnet2 #3824 by @DanBerrebbi
- [New Features][ESPnet2][SE] Refactor loss computation in enhancement tasks. #3838 by @LiChenda
Recipe
- [Recipe][ESPnet1][ESPnet2][ASR][README] updated the results of aidatatang_200zh #3925 by @sw005320
- [Recipe][ESPnet1][VC] Various fixes of voice conversion recipes #3800 by @unilight
- [Recipe][ESPnet2][ASR][README] Expanding egs2 of Tedlium2 #3795 by @D-Keqi
- [Recipe][ESPnet2][ASR][README] Update an4 config #3913 by @pyf98
- [Recipe][ESPnet2][ASR][README] aidatatang_200zh recipe #3892 by @sw005320
- [Recipe][ESPnet2][README] Update README.md #3881 by @daisylab
- [Recipe][ESPnet2][README] Update egs2/TEMPLATE/README.md #3793 by @kamo-naoyuki
- [Recipe][ESPnet2][README] fix readme #3827 by @seastar105
- [Recipe][ESPnet2][README][Recipe] Add ASR Recipe: Primewords_Chinese #3903 by @pyf98
- [Recipe][ESPnet2][README][Recipe] Update MISP challenge ASR baseline and add AVSR baseline #3819 by @neillu23
- [Recipe][ESPnet2][README][SLU] Fsc Maseeval scripts #3769 by @siddhu001
- [Recipe][ESPnet2][README][SLU] Update Google Speechcommands (SLU recipe) #3915 by @pyf98
- [Recipe][ESPnet2][README][TTS] ESPnet2 ARCTIC TTS #3791 by @peter-yh-wu
- [Recipe][ESPnet2][README][TTS] Update README and add missing config #3917 by @kan-bayashi
- [Recipe][ESPnet2][Recipe][SLU] Slue voxceleb Sentiment Analysis #3894 by @siddhu001
- [Recipe][ESPnet2][SE] modified data type in enh.sh #3768 by @simpleoier
Bugfix
- [Bugfix][ESPnet1][README][RNNT] Fix cache for Transducer search strategies + doc #3869 by @b-flo
- [Bugfix][ESPnet1][RNNT] Fix recombine_hyps #3908 by @b-flo
- [Bugfix][ESPnet1][RNNT] fix rnn-t ALSD beam search index bug #3794 by @maxwellzh
- [Bugfix][ESPnet1][RNNT] fix the sort order in select_k_expansions() #3864 by @freewym
- [Bugfix][ESPnet2] Bug fix for .gitignore and db fill up for CMU cluster #3891 by @siddalmia
- [Bugfix][ESPnet2] Fix #3716 #3849 by @kan-bayashi
- [Bugfix][ESPnet2] Merging asr_streaming.sh into asr.sh for laborotv egs2 #3868 by @D-Keqi
- [Bugfix][ESPnet2] add init.py #3928 by @sw005320
- [Bugfix][ESPnet2] fix small problem that used before defined in step 12 #3871 by @simpleoier
- [Bugfix][ESPnet2] fix stft olens when win_lengths is not equal to n_fft #3812 by @IceCreamWW
- [Bugfix][ESPnet2] update s3prl frontend w.r.t. recent modification in s3prl interface #3839 by @simpleoier
- [Bugfix][ESPnet2][TTS] bugfix lang2lid in tts.sh #3906 by @imdanboy
- [Bugfix][Installation] Fix #3783 #3786 by @kamo-naoyuki
Others
- [CI] Fix G2P test failure in CI due to the dict update #3848 by @kan-bayashi
- [CI][Documentation][ESPnet1][ESPnet2] Fixing issues about streaming Transformer/Conformer training #3880 by @D-Keqi
- [CI][ESPnet1][ESPnet2][Installation][New Features][README] nbest rescoring with k2 #3567 by @glynpu
- [Documentation][README] Update README.md #3893 by @sw005320
- [Documentation][README][SSL] Add more docs about s3prl frontend #3796 by @simpleoier
- [Documentation][README][streaming] Updating main README.md about streaming transformer #3855 by @D-Keqi
- [ESPnet1][RNNT] Add exception for conformer decoder #3801 by @b-flo
- [ESPnet2][README][Typo] Fix typo in README.md #3852 by @kan-bayashi
- [ESPnet2][SE] add eps in beam-forming reference channel selection #3904 by @LiChenda
- [ESPnet2][SLU] Add unit test for score_intent.py #3759 by @siddhu001
- [ESPnet2][ST] Speech Translation Update #3860 by @ftshijt
- [ESPnet2][TTS][Installation][Refactoring] Refactor Phonemizer-based G2P #3916 by @kan-bayashi
Acknowledgements
Special thanks to @D-Keqi, @DanBerrebbi, @IceCreamWW, @LiChenda, @b-flo, @brianyan918, @daisylab, @freewym, @ftshijt, @glynpu, @imdanboy, @kamo-naoyuki, @kan-bayashi, @komatta-san, @maxwellzh, @neillu23, @peter-yh-wu, @pyf98, @seastar105, @siddalmia, @siddhu001, @simpleoier, @sw005320, @unilight.