v.202402
版本发布时间: 2024-02-06 11:28:17
espnet/espnet最新发布版本:v.202409(2024-10-01 14:28:01)
News
We're thrilled to announce that our latest update brings two groundbreaking features to our project: espnetez
and ESPnet-SPK
!
New Features
- [New Features][ESPnet2][ESPnet1][Installation][SE] Add diffusion-base SE model to ESPnet-SE #5572 by @LiChenda
- [New Features][ESPnet2][ESPnet1][CI][ASR] Add Bayes Risk CTC (reworked) #5519 by @jctian98
- [New Features][ESPnet2][TTS] TTS evaluation script and monitoring functionality using MOS prediction model #5485 by @Takaaki-Saeki
- [New Features][ESPnet2][SE] Add USES model for speech enhancement in diverse conditions #5482 by @Emrys365
- [New Features][ESPnet2][CI][SID] ESPnet-SPk: major update #5408 by @Jungjee
- [New Features][ESPnet2][TTS][ASR] Add espnetez #5372 by @Masao-Someki
Enhancement
- [Enhancement][ESPnet2][OWSM] Improving OWSM inference interface #5618 by @pyf98
- [Enhancement][ESPnet2][OWSM] Add OWSM v3.1 #5611 by @pyf98
- [Enhancement][ESPnet2][CI] ESPnet-SPK: Additional models, supplement readme #5559 by @Jungjee
- [Enhancement][ESPnet2][CI][SE] Add PyTorch & GPU support for DNSMOS calculation #5548 by @Emrys365
- [Enhancement][ESPnet2][TTS][SID] Speaker embedding extractor (with ESPnet pre-trained speaker model) #5579 by @ftshijt
Recipe
- [Recipe][ESPnet2][Music] Fix relative setting of train-dev-test #5623 by @ftshijt
- [Recipe][ESPnet2][SID] ESPnet-SPK: add Voxblink recipe #5583 by @Jungjee
- [Recipe][ESPnet2][SID] ESPnet-SPK: Model upload and result generation #5558 by @Jungjee
- [Recipe][ESPnet2][Music] ACE singer recipe fixing #5551 by @ftshijt
- [Recipe][ESPnet2][TTS] TTS2 Template #5541 by @ftshijt
- [Recipe][ESPnet2][ASR] fix kaldi dependency in asr2 #5540 by @ftshijt
- [Recipe][ESPnet2][CI][S2ST] CI test for s2st #5526 by @ftshijt
- [Recipe][ESPnet2][ASR] Added data.sh to SPRING-INX IITM Recipe #5522 by @arjun-gangwar
- [Recipe][ESPnet2][ASR] Add Libriheavy small and medium ASR2 recipes #5512 by @akreal
- [Recipe][ESPnet2][ASR] SPRING-INX IITM RECIPE #5505 by @arjun-gangwar
- [Recipe][ESPnet2][ASR][RNNT] Add transducer conformer configuration to commonvoice recipe #5503 by @zuazo
- [Recipe][ESPnet2][ESPnet1] add centralized data preparation for OWSM #5478 by @jctian98
- [Recipe][ESPnet1] Added clean speech results #5649 by @linan2
- [Recipe][ESPnet2][Installation][AV] AVSR recipe for Easycom Dataset #5630 by @ms-dot-k
- [Recipe][ESPnet2] Update CHiME-7 ASR1 recipe #5555 by @popcornell
- [Recipe][ESPnet2] Add E-Branchformer model checkpoint in OWSM v2 #5517 by @pyf98
- [Recipe][ESPnet2][SLU] Slue PR configs #5087 by @siddhu001
Bugfix
- [Bugfix][ESPnet2] Fix path dependency in ESPnet tutorial #5645 by @siddhu001
- [Bugfix][ESPnet2] Fix ESPnet tutorial #5644 by @siddhu001
- [Bugfix] Fix CI #5642 by @siddhu001
- [Bugfix][ESPnet2] Fixed bug by copying missing Kaldi scripts #5636 by @VicentCano
- [Bugfix][ESPnet1][ASR] CTC prefix score, fix if blank == eos #5620 by @albertz
- [Bugfix][ESPnet2] Fix minor OWSM data prep bug #5607 by @juice500ml
- [Bugfix][ESPnet2][ESPnet1][CI] E721 #5589 by @sw005320
- [Bugfix][ESPnet2][ESPnet1] Make minlenratio effective #5581 by @jctian98
- [Bugfix][ESPnet2] Fix except #5567 by @takenori-y
- [Bugfix][ESPnet1][Installation][CI] Improve error robustness of unit tests #5535 by @Emrys365
- [Bugfix][ESPnet2][AV] Fix bug in lrs3 data preprocessing #5520 by @ms-dot-k
- [Bugfix][ESPnet1] replace old mustc links with new instructions #5516 by @brianyan918
- [Bugfix][ESPnet2][ST] Fix s2st HF model uploading #5504 by @tjysdsg
- [Bugfix][ESPnet2][ESPnet1] bug fixes for must_c v2 recipe #5640 by @jasonmusespresso
Documentation
- [Documentation][ESPnet2] Add instructions for finetuning owsm #5539 by @pyf98
- [Documentation] Updated the reference of the accepted JOSS paper #5515 by @neillu23
Others
- [Others] Update Discord Invitation Link #5578 by @Fhrozen
- [Others][ESPnet2][CI] Improve error robustness of unit tests #5523 by @Emrys365
Acknowledgements
Special thanks to @Emrys365, @Fhrozen, @Jungjee, @LiChenda, @Masao-Someki, @Takaaki-Saeki, @VicentCano, @akreal, @albertz, @arjun-gangwar, @brianyan918, @ftshijt, @jasonmusespresso, @jctian98, @juice500ml, @linan2, @ms-dot-k, @neillu23, @popcornell, @pyf98, @siddhu001, @sw005320, @takenori-y, @tjysdsg, @zuazo.