v.202301
版本发布时间: 2023-02-01 18:50:13
espnet/espnet最新发布版本:v.202409(2024-10-01 14:28:01)
What's Changed
- Initialize VISinger branch by @ftshijt in https://github.com/espnet/espnet/pull/4683
- Update VISInger branch by @ftshijt in https://github.com/espnet/espnet/pull/4705
- Update UASR branch with latest ESPnet functions by @ftshijt in https://github.com/espnet/espnet/pull/4752
- Update uasr by @ftshijt in https://github.com/espnet/espnet/pull/4770
- Shell scripts for UASR processing by @ftshijt in https://github.com/espnet/espnet/pull/4769
- Uasr python scripts by @DongjiGao in https://github.com/espnet/espnet/pull/4791
- Update visinger by @ftshijt in https://github.com/espnet/espnet/pull/4818
- Update test_custom_transducer.py by @sw005320 in https://github.com/espnet/espnet/pull/4826
- Update asr.sh by @sw005320 in https://github.com/espnet/espnet/pull/4827
- Fixed pad mode for librosa.stft by @Masao-Someki in https://github.com/espnet/espnet/pull/4832
- Add E-Branchformer models in some recipes by @pyf98 in https://github.com/espnet/espnet/pull/4833
- Fix data prep in GigaSpeech by @pyf98 in https://github.com/espnet/espnet/pull/4836
- time sync decoding for asr by @brianyan918 in https://github.com/espnet/espnet/pull/4792
- Remove duplicated VOXFORGE in db.sh (line81 and line157) by @pyf98 in https://github.com/espnet/espnet/pull/4840
- Fix argument parsing for non_linguistic_symbols in asr.sh by @pyf98 in https://github.com/espnet/espnet/pull/4841
- Add a warning statement when the hypo length equals to the max out length. by @pengchengguo in https://github.com/espnet/espnet/pull/4843
- Add target speaker extraction (TSE) functions by @Emrys365 in https://github.com/espnet/espnet/pull/4823
- Multilingual superb by @ftshijt in https://github.com/espnet/espnet/pull/4824
- VISinger by @jerryuhoo in https://github.com/espnet/espnet/pull/4689
- Update VISInger to latest by @ftshijt in https://github.com/espnet/espnet/pull/4849
- VISinger for singing voice synthesis by @ftshijt in https://github.com/espnet/espnet/pull/4848
- Reduce word counts for ESPnet-SE++ Joss paper by @neillu23 in https://github.com/espnet/espnet/pull/4844
- Add E-Branchformer configs and models in ASR recipes by @pyf98 in https://github.com/espnet/espnet/pull/4837
- Address Muskits updates on README by @ftshijt in https://github.com/espnet/espnet/pull/4850
- Minor fix for MSUPERB recipe by @ftshijt in https://github.com/espnet/espnet/pull/4851
- Update for the latest changes in the draft (minor changes) by @neillu23 in https://github.com/espnet/espnet/pull/4852
- Add E-Branchformer results on Librispeech by @kkim-asapp in https://github.com/espnet/espnet/pull/4856
- Update hubert implementation. by @simpleoier in https://github.com/espnet/espnet/pull/4747
- VISinger unit test by @jerryuhoo in https://github.com/espnet/espnet/pull/4855
- Minor fix to commonvoice espnet1 by @ftshijt in https://github.com/espnet/espnet/pull/4862
- [WIP] Add S4 decoder in ESPnet2 by @m-koichi in https://github.com/espnet/espnet/pull/4845
- Update hubert feature and acknowledge information in related Readmes. by @simpleoier in https://github.com/espnet/espnet/pull/4863
- Generating MFA aligments by @Fhrozen in https://github.com/espnet/espnet/pull/4803
- [WIP] EURO uasr scripts by @DongjiGao in https://github.com/espnet/espnet/pull/4846
- Update README.md related to ASR architecture by @m-koichi in https://github.com/espnet/espnet/pull/4865
- Minor fix to librimix diar recipe by @ftshijt in https://github.com/espnet/espnet/pull/4867
- Add Full Whisper Model for Finetuning by @slSeanWU in https://github.com/espnet/espnet/pull/4793
- Add torchaudio version check for HuBERT pretraining by @simpleoier in https://github.com/espnet/espnet/pull/4872
- add k2 decoder related scripts for EURO by @DongjiGao in https://github.com/espnet/espnet/pull/4868
- EURO: small fix (temporarily remove support for nbest_rescoring) by @DongjiGao in https://github.com/espnet/espnet/pull/4875
- Add description for Whisper ASR in homepage readme by @slSeanWU in https://github.com/espnet/espnet/pull/4877
- Update README.md by @eltociear in https://github.com/espnet/espnet/pull/4879
- add explanations to text tokenizing related scripts and remove unused script by @DongjiGao in https://github.com/espnet/espnet/pull/4880
- update information about source and our modification for k2 related scripts by @DongjiGao in https://github.com/espnet/espnet/pull/4881
- AphasiaBank ASR recipe by @tjysdsg in https://github.com/espnet/espnet/pull/4860
- Multilingual SUPERB update by @ftshijt in https://github.com/espnet/espnet/pull/4878
- ESPnet Unsupervised ASR (EURO project) by @ftshijt in https://github.com/espnet/espnet/pull/4774
- Support ProDiff in TTS by @Fhrozen in https://github.com/espnet/espnet/pull/4808
- Add E-Branchformer for GigaSpeech by @pyf98 in https://github.com/espnet/espnet/pull/4882
- FLEURS - Auxillary CTC conditioning tasks by @wanchichen in https://github.com/espnet/espnet/pull/4756
- Add python 3.8 requirement for Whisper & update tests by @slSeanWU in https://github.com/espnet/espnet/pull/4891
- Update some ASR results in the main readme file by @pyf98 in https://github.com/espnet/espnet/pull/4883
- Add Conv2dSubsampling1 module and test it in AphasiaBank ASR recipe by @tjysdsg in https://github.com/espnet/espnet/pull/4892
- Support x-vector extractor based on RawNet by @Takaaki-Saeki in https://github.com/espnet/espnet/pull/4884
- single language track setups by @DanBerrebbi in https://github.com/espnet/espnet/pull/4895
- fixing bug deu1 by @DanBerrebbi in https://github.com/espnet/espnet/pull/4900
- Fix dataprep issues based on updated data release via Google form by @roshansh-cmu in https://github.com/espnet/espnet/pull/4899
- Add a new EGS2 recipe 'reazonspeech' by @fujimotos in https://github.com/espnet/espnet/pull/4885
- Update version to 202301 by @kan-bayashi in https://github.com/espnet/espnet/pull/4901
New Contributors
- @DongjiGao made their first contribution in https://github.com/espnet/espnet/pull/4791
- @jerryuhoo made their first contribution in https://github.com/espnet/espnet/pull/4689
- @m-koichi made their first contribution in https://github.com/espnet/espnet/pull/4845
- @fujimotos made their first contribution in https://github.com/espnet/espnet/pull/4885
Full Changelog: https://github.com/espnet/espnet/compare/v.202211...v.202301