v.0.9.9
版本发布时间: 2021-04-07 12:51:48
espnet/espnet最新发布版本:v.202409(2024-10-01 14:28:01)
New Features
- [New Features][ESPnet2] Speaker diarization implementation in ESPnet #2939 by @ftshijt
- [New Features][ESPnet2] Adding gpu_max_cached_mem_GB in reporter's stats #3057 by @kamo-naoyuki
- [New Features][ESPnet2] add --detect_anomaly option #3035 by @kamo-naoyuki
- [New Features][ESPnet2][SE] Further update to speech enhancement task #2929 by @shincling
Bugfix
- [Bugfix][ESPnet1] Fix a typo in the aishell config #3089 by @sw005320
- [Bugfix][ESPnet1] Fix utils/speed_perturb.sh #3062 by @hirofumi0810
- [Bugfix][ESPnet1] fix #3017 #3022 by @kamo-naoyuki
- [Bugfix][ESPnet1][RNNT] Fix+update RNN encoder #3048 by @b-flo
- [Bugfix][ESPnet1][RNNT] Minor fix for NSC #3030 by @b-flo
- [Bugfix][ESPnet2] Fix #3072 #3073 by @kamo-naoyuki
- [Bugfix][ESPnet2] Fix ESPnet2-TTS conformer backward compatibility #3108 by @kan-bayashi
- [Bugfix][ESPnet2] Fix a bug when use_amp=True without fairscale #3029 by @kamo-naoyuki
- [Bugfix][ESPnet2] Fix logging for pytorch>=1.8 #3056 by @kamo-naoyuki
- [Bugfix][ESPnet2] Fixed backward compatibility issue of new conformer definition #3068 by @hfujihara
- [Bugfix][Installation] Fix a bug of uninstalling typing #3058 by @kamo-naoyuki
- [Bugfix][Installation] Fix setup.py to install filelock #3074 by @kamo-naoyuki
- [Bugfix][Installation] fix the condition to install fairscale #3050 by @kamo-naoyuki
- [Bugfix][Recipe][ESPnet1] Typo fixed for nahuatl recipe #3044 by @ftshijt
- [Bugfix][Recipe][ESPnet1][ASR] Bugfix for download_and_untar for nahuatl #3049 by @ftshijt
- [Bugfix][Recipe][ESPnet1][ESPnet2][TTS] Fix CSMSC download script #3109 by @kan-bayashi
- [Bugfix][Recipe][ESPnet2][TTS][README] fixed typo #3121 #3123 by @kan-bayashi
Enhancement
- [Enhancement][ASR][ESPnet1][RNNT] Update loss report #3110 by @b-flo
- [Enhancement][ESPnet1][RNNT] Fix related to custom encoder and aux task #3045 by @b-flo
- [Enhancement][ESPnet2][Documentation][Installation][README] modification of freezing option for Wav2Vec encoder, add documents #3036 by @simpleoier
Recipe
- [Recipe][ESPnet1][ASR] added results and uploaded models #3063 by @sw005320
- [Recipe][ESPnet1][ASR][ST] fix download for puebla-nahuatl #3039 by @ftshijt
- [Recipe][ESPnet1][MT] Update IWSLT18 MT recipe #3071 by @hirofumi0810
- [Recipe][ESPnet1][ST] IWSLT21-low-resource recipe #3023 by @ftshijt
- [Recipe][ESPnet1][ST] Nahuatl Speech Translation #3034 by @ftshijt
- [Recipe][ESPnet2][ASR][README] Added spgispeech recipe in espnet2 #2986 by @sw005320
- [Recipe][ESPnet2][ASR][README] Update librispeech result #3082 by @kamo-naoyuki
- [Recipe][ESPnet2][ASR][README] Updated ami ihm result #3091 by @kamo-naoyuki
- [Recipe][ESPnet2][ASR][README] added a bpe10000 model and result #3060 by @sw005320
- [Recipe][ESPnet2][ASR][README] gigaspeech #3077 by @sw005320
Refactoring
- [Refactoring][ESPnet1] Refactor layer selection in Transformer #3024 by @hirofumi0810
- [Refactoring][ESPnet1][MT][ST] Unify divide_lang.sh #3066 by @hirofumi0810
- [Refactoring][ESPnet2] Make batch bins sampler faster #3106 by @kamo-naoyuki
- [Refactoring][Installation] Use new pyopenjtalk version #3107 by @kan-bayashi
- [Refactoring][ESPnet1][ESPnet2][Installation][Docker][Documentation] Change '#!/bin/bash' to '#!/usr/bin/env bash' #3059 by @kamo-naoyuki
Other
- [CI][Installation][README][mergify] Using torch=1.8.1 in ci tests #3122 by @kamo-naoyuki
- [CI][Installation][README][mergify] Adding pytorch=1.8.0 to the ci #3046 by @kamo-naoyuki
Acknowledgements
Special thanks to @b-flo, @ftshijt, @hfujihara, @hirofumi0810, @kamo-naoyuki, @kan-bayashi, @shincling, @simpleoier, @sw005320.