v.202409
版本发布时间: 2024-10-01 14:28:01
espnet/espnet最新发布版本:v.202409(2024-10-01 14:28:01)
New Features
- [New Features][ESPnet2][TTS][Codec] Support Codec feature for TTS2 task #5857 by @wyh2000
- [New Features][ESPnet2][Codec] Codec downstream task support: TTS #5763 by @jctian98
- [New Features][ESPnet2][Codec] Add Encodec features for Codec toolkit #5758 by @jctian98
- [New Features][ESPnet2][Installation][TTS] Add evaluation scripts with DiscreteSpeechMetrics. #5661 by @Takaaki-Saeki
- [New Features][ESPnet2][ASR] Integrate adapter for s3prl frontend #5609 by @Stanwang1210
- [New Features][ESPnet2][CI][OWSM] Support external dataset library for ESPnetEasy #5584 by @Masao-Someki
- [New Features][ESPnet2][CI][LM] Pr voxtlm #5472 by @soumimaiti
Enhancement
- [Enhancement][ESPnet2][SLM] MT Task in SpeechLM #5899 by @ftshijt
- [Enhancement][ESPnet2][Codec] Categorical Balnced Chunk iterator #5894 by @ftshijt
- [Enhancement][ESPnet2][ESPnet1] TransformerDecoder forward_one_step with memory_mask #5679 by @albertz
- [Enhancement][ESPnet2] Update espnet_model.py #5646 by @shen9712
Recipe
- [Recipe][ESPnet2][Music] Fixed KiSing Data Preparation #5895 by @HANJionghao
- [Recipe][ESPnet2][ASR] CORAAL asr1 recipe #5882 by @kalvinchang
- [Recipe][ESPnet2][ASR] ml_superb asr2 recipe #5866 by @Stanwang1210
- [Recipe][ESPnet2] Add more download links for ML-SUPERB #5863 by @ftshijt
- [Recipe][ESPnet2][ASR] Fix bug in asr2.sh #5859 by @juice500ml
- [Recipe][ESPnet2][Music] fix bugs in SVS1 #5851 by @South-Twilight
- [Recipe][ESPnet2][TTS] New Recipe of tts2+aishell3 #5849 by @Tsukasane
- [Recipe][ESPnet2][ASR] Espnet Multi-convformer implementation #5832 by @Darshan7575
- [Recipe][ESPnet2][SE] Update of SE functions #5825 by @Emrys365
- [Recipe][ESPnet2] SPRING-INX Recipe (Speech Lab, IIT, Madras) #5811 by @arjun-gangwar
- [Recipe][ESPnet2][TTS] Adding Hifitts recipe for espnet #5784 by @coding-phoenix-12
- [Recipe][ESPnet2][ASR] Updated results for CHiME-8 DASR baseline with new notsofar1 dev set #5771 by @popcornell
- [Recipe][ESPnet2][SE] Final model scores for TF-GridNetV2 on the Kinect-WSJ dataset #5754 by @atharva253
- [Recipe][ESPnet2] Apply normalization on validation set for CHiME-8 recipe #5749 by @popcornell
- [Recipe][ESPnet2][Need review][Codec] ESPnet-Codec decoding and Scoring #5747 by @ftshijt
- [Recipe][ESPnet2][CI][ST] Add recipe for IWSLT 2024 shared task Indic track #5744 by @cromz22
- [Recipe][ESPnet2][Music] [SVS] VISinger Plus #5741 by @jerryuhoo
- [Recipe][ESPnet2][Need review][Codec] ESPnet-codec Training and Setup #5732 by @ftshijt
- [Recipe][ESPnet2][ASR] ESPnet Recipe for ASR on the Makerere Radio Speech Corpus #5730 by @satvik-dixit
- [Recipe][ESPnet2][SE] ESPnet recipe for the Kinect-WSJ dataset #5711 by @atharva253
- [Recipe][ESPnet2][TTS][ASR][Music] Update bitrate calculation scripts for the IS24 discrete speech challenge #5677 by @ftshijt
- [Recipe][ESPnet2][ASR] Add some documents for JTubeSpeech #5663 by @sw005320
- [Recipe][ESPnet2][SID] ESPnet-SPK: add SdSV 2021 recipe #5659 by @Alexgichamba
- [Recipe][ESPnet2][ASR] Add E-Branchformer model for FLEURS #5657 by @wanchichen
- [Recipe][ESPnet2][Installation][CI][ASR] CHiME-8 DASR recipe based on CHiME-7 DASR baseline #5641 by @popcornell
- [Recipe][ESPnet2][ASR] add interspeech2024_dsu_challenge/asr2 #5627 by @simpleoier
- [Recipe][ESPnet2][Installation][TTS] Discrete token-based TTS implementation #5626 by @ftshijt
Bugfix
- [Bugfix] fix: replace ellipses (...) in ESPnet-EZ Trainer documentation #5911 by @kalvinchang
- [Bugfix] Bugfix/homepage #5885 by @Masao-Someki
- [Bugfix][ESPnet2] Fix absolute paths in aishell3_tts2 #5884 by @Tsukasane
- [Bugfix] Bug fix for source link #5883 by @Masao-Someki
- [Bugfix][Installation] [CI] Add required file for g2p_en #5869 by @Fhrozen
- [Bugfix][ESPnet2] A fix to newer torch version (compatible to old version with typecheck) #5830 by @ftshijt
- [Bugfix][ESPnet2] Revert change to abs_task to keep the consistency behavior #5789 by @ftshijt
- [Bugfix][ESPnet2] Fix Whisper frontend #5760 by @siddhu001
- [Bugfix][ESPnet2][SE] Update TSE recipe egs2/librimix/tse1 #5731 by @Emrys365
- [Bugfix][ESPnet2] Fix LoRA issues when saving all parameters. #5722 by @simpleoier
- [Bugfix][ESPnet2] Fix tts packing with new spk embedding #5715 by @ftshijt
- [Bugfix][ESPnet2][TTS] Fix stage references in generated run.sh in TTS recipes #5714 by @G-Thor
- [Bugfix][ESPnet2][OWSM] fix a small issue in OWSM decode_long #5703 by @jctian98
- [Bugfix][ESPnet2][Installation] Upgrade typeguard #5702 by @sw005320
- [Bugfix][ESPnet2] Quick fix to calculation of bitrate #5692 by @ftshijt
- [Bugfix][ESPnet2][SSUM] Fix typo in summarization scoring #5688 by @YoshikiMas
- [Bugfix][ESPnet2] Update egs2/TEMPLATE/asr2/asr2.sh #5682 by @simpleoier
- [Bugfix][ESPnet2][ASR] Fix over-lengthy audio in ml_superb data prep #5678 by @ftshijt
- [Bugfix][ESPnet2] fix typo #5673 by @hiranoyu0830
- [Bugfix][Installation][ST] Fix CI Multilingual ST test #5672 by @Fhrozen
- [Bugfix][ESPnet2][SLU] Fix speed perturbation when not using transcript in slu.sh #5671 by @siddhu001
- [Bugfix][ESPnet2][SLU] Fix loading pre-trained model from transformers #5668 by @siddhu001
- [Bugfix][ESPnet2] Correct the argument errors in the whisper tokenizer language. #5666 by @pengchengguo
Documentation
- [Documentation][ESPnet2][Music] Fixed SingingGenerate docstring examples #5889 by @HANJionghao
- [Documentation][ESPnet2][CI] Separate packing and uploading stages #5752 by @cromz22
- [Documentation] Add script to make release note from milestone #5653 by @kan-bayashi
Refactoring
- [Refactoring] Modified easy to ez #5719 by @Masao-Someki
Others
- [Others][CI] Bugfix for the paper publish workflow #5909 by @juice500ml
- [Others][ESPnet2] Revision on Speechlm vocabulary extension script #5906 by @jctian98
- [Others][ESPnet2][TTS] Fix tts.sh path in aishell3 tts2 #5879 by @sw005320
- [Others][ESPnet2][Installation] Add DeepSpeed trainer for large-scale training #5856 by @jctian98
- [Others] Update README info #5852 by @ftshijt
- [Others][ESPnet2][ESPnet1][Installation] Add flash-attn #5839 by @wanchichen
- [Others][ESPnet2][Music] [SVS] fix VISinger2 typecheck error #5838 by @jerryuhoo
- [Others][ESPnet2] Fixed kising/acesinger google drive download #5834 by @HANJionghao
- [Others][ESPnet2][SID] update MFA-Conformer performance after fixing the bug in #5797 #5826 by @Jungjee
- [Others][ESPnet2][CI][SE] SE function updates: new models and support for handling various sampling frequencies #5800 by @Emrys365
- [Others][ESPnet2][SID] fix spk mfa-conformer forwarding #5797 by @series2
- [Others][ESPnet2][CI][Music] [SVS] Add CI tests for VISinger Plus #5786 by @jerryuhoo
- [Others][ESPnet2][LM] Bug fix for VoxtLM v1 recipe #5782 by @cromz22
- [Others][ESPnet2][ESPnet1] Added partially auto-regressive decoding #5769 by @Masao-Someki
- [Others][Installation][CI] Fix minor issue in anaconda downloading #5753 by @ftshijt
- [Others] [pre-commit.ci] pre-commit autoupdate #5738 by @pre-commit-ci[bot]
- [Others][ESPnet2][Installation][CI] Upgrade typeguard [Subst.] #5724 by @Fhrozen
- [Others][ESPnet2][SE] TF-GridNet training recipe for DNS Interspeech 2020 dataset #5710 by @nateanl
- [Others][ESPnet2][LM] Adding transformer_opt #5709 by @soumimaiti
- [Others][ESPnet2] Add Readme for Voxtlm #5693 by @wyh2000
- [Others][ESPnet2][SID] ESPnet-SPK: add ASVspoof19 SASV recipe #5687 by @Alexgichamba
Acknowledgements
Special thanks to @Alexgichamba, @Darshan7575, @Emrys365, @Fhrozen, @G-Thor, @HANJionghao, @Jungjee, @Masao-Someki, @South-Twilight, @Stanwang1210, @Takaaki-Saeki, @Tsukasane, @YoshikiMas, @albertz, @arjun-gangwar, @atharva253, @coding-phoenix-12, @cromz22, @ftshijt, @hiranoyu0830, @jctian98, @jerryuhoo, @juice500ml, @kalvinchang, @kan-bayashi, @nateanl, @pengchengguo, @popcornell, @pre-commit-ci[bot], @satvik-dixit, @series2, @shen9712, @siddhu001, @simpleoier, @soumimaiti, @sw005320, @wanchichen, @wyh2000.