v1.8.0
版本发布时间: 2022-04-20 12:29:29
NVIDIA/NeMo最新发布版本:r2.0.0rc1(2024-08-16 05:55:14)
Known Issues
Issues
- Megatron BERT export does not currently work in the NVIDIA NGC PyTorch 22.03 container. The issue will be fixed in the NGC PyTorch 22.04 container.
- pytest for Vietnamese inverse text normalization are failing. Fixed in main
Container
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.03
ASR
Changelog
- ASR SSL Update by @sam1373 :: PR: #3714
- Polylang asr by @bmwshop :: PR: #3721
- Test grad accumulation for RNNT loss by @titu1994 :: PR: #3731
- Add readme files describing model execution flow for ASR tasks by @titu1994 :: PR: #3812
- add fr asr ckpt to doc by @yzhang123 :: PR: #3809
- Fix asr tests in 22.02 by @titu1994 :: PR: #3823
- Add new pretrained Spanish ASR models by @erastorgueva-nv :: PR: #3830
- Documentation updates for ASR by @titu1994 :: PR: #3846
- Offline VAD+ASR tutorial by @fayejf :: PR: #3828
- Added Hindi and Marathi Models in Nemo pretrained ASR_CTC_BPE models … by @meghmak13 :: PR: #3856
- Add a missing line to ASR_with_NeMo.ipynb by @lifefeel :: PR: #3908
- Multilang asr models by @bmwshop :: PR: #3907
- added stt_en_conformer_transducer_large_ls to NGC by @VahidooX :: PR: #3920
- Fix DALI test on 22.03 by @titu1994 :: PR: #3911
- Adding RNN encoder for LSTM-Transducer and LSTM-CTC models by @VahidooX :: PR: #3886
- Fix issue with Segfault in ASR models by @titu1994 :: PR: #3956
- Added Mandarin pretrained Conformer-Transducer-Large model trained on AISHELL2. by @VahidooX :: PR: #3970
TTS
Changelog
- Bump TTS deprecation version to 1.9 by @blisc :: PR: #3955
- Add pinned pynini and scipy installs to TTS training tutorial by @redoctopus :: PR: #3967
- Compatability override to load_state_dict for old TTS checkpoints by @redoctopus :: PR: #3978
NLP / NMT
Changelog
- Use worker processes for data preprocessing by @crcrpar :: PR: #3665
- Set find_unused_parameters to False in GPT example script by @ericharper :: PR: #3837
- GPT multinode eval by @ericharper :: PR: #3821
- Fix MegatronPretrainingRandomSampler by taking into account by @crcrpar :: PR: #3826
- Add slot filling into DST Generative model by @Zhilin123 :: PR: #3695
- Disable nvfuser for gpt by @ericharper :: PR: #3845
- Multi-Label Joint Intent Slot Classification by @chenrichard10 :: PR: #3742
- fix bug in intent/slot model reloading by @carolmanderson :: PR: #3874
- Make test_gpt_eval unit test less strict by @yidong72 :: PR: #3898
- Comment gpt resume ci test by @MaximumEntropy :: PR: #3901
- Neural Machine Translation with Megatron Transformer Models (Tensor Parallel and Tarred Datasets Only) by @MaximumEntropy :: PR: #3861
- Megatron support by @ramanathan831 :: PR: #3893
- Populate the GPT/BERT with uploaded models by @yidong72 :: PR: #3885
- Megatron BART by @michalivne :: PR: #3666
- Additional Japanese processor for NMT that uses MeCab segmentation. Fix for BLEU in one-many NMT by @MaximumEntropy :: PR: #3889
- NMT GRPC sever URL fix by @MaximumEntropy :: PR: #3918
- Megatron legacy conversion support by @ramanathan831 :: PR: #3919
- Update max_epochs on megatron configs by @ericharper :: PR: #3958
- Fix NMT variable passing bug by @aklife97 :: PR: #3985
- Fix nemo megatron restore with artifacts by @ericharper :: PR: #3997
- Fix megatron notebook by @ramanathan831 :: PR: #4004
- Megatron work-arounds by @borisfom :: PR: #3998
- Add T5 model P-tuning support by @yidong72 :: PR: #3768
- Make index mappings dir configurable by @ericharper :: PR: #3868
- T5 pipeline parallel by @MaximumEntropy :: PR: #3750
Text Normalization / Inverse Text Normalization
Changelog
- Tn es by @bonham79 :: PR: #3632
- Fix single GPU training issue + change deprecated Lightning args by @aklife97 :: PR: #4010
Export
Changelog
- Conformer WARs for TRT8.2 by @borisfom :: PR: #3787
- bert_module: fix inputs of export model by @virajkarandikar :: PR: #3815
- Exports 22.03 war by @borisfom :: PR: #3957
Bugfixes
Changelog
- patch librosa deprecation and fix by @fayejf :: PR: #3818
General Improvements
Changelog
- Pynini pip by @yzhang123 :: PR: #3726
- upgrade PTL trainer flags by @nithinraok :: PR: #3589
- Updated Speech Data Explorer by @vsl9 :: PR: #3710
- Fix spelling error in num_workers parameter to actually set number of dataset workers specified in yaml configs by @themikem :: PR: #3800
- Support for Camembert Huggingface bert-like models by @itzsimpl :: PR: #3799
- Update to 22.02 by @ericharper :: PR: #3771
- Fixing the defaults of conformer models in the config files by @VahidooX :: PR: #3836
- Fix T5 Encoder Mask while decoding by @MaximumEntropy :: PR: #3838
- fix: multilingual transcribe does not require lang id param by @bmwshop :: PR: #3833
- Misc improvements by @titu1994 :: PR: #3843
- Change container by @MaximumEntropy :: PR: #3844
- Making gender assignment random for cardinals, fractions, and decimal… by @bonham79 :: PR: #3759
- Jenkinsfile test changes by @chenrichard10 :: PR: #3879
- Adding a RegEx tokenizers by @michalivne :: PR: #3839
- enable bias+dropout+add fusion with nvfuser at inference by @erhoo82 :: PR: #3869
- Add text_generation_util to support TopK, TopP sampling + Tabular Data Generation. by @yidong72 :: PR: #3834
- Ptl requirements bound by @MaximumEntropy :: PR: #3903
- doc links update by @ekmb :: PR: #3891
- add citations by @yzhang123 :: PR: #3902
- Update NeMo CI to 22.03 by @MaximumEntropy :: PR: #3900
- Add domain groups to changelog builder by @titu1994 :: PR: #3904
- add input threshhold by @yzhang123 :: PR: #3913
- improvements to commonvoice data script by @bmwshop :: PR: #3892
- fixes to the cleanup flag by @bmwshop :: PR: #3921
- Upgrade to PTL 1.6.0 by @ericharper :: PR: #3890
- JSON output from diarization now includes sentences. Optimized senten… by @demsarjure :: PR: #3897
- Stateless timer fix for PTL 1.6 by @MaximumEntropy :: PR: #3925
- fix save_best missing chpt bug, update for setup_tokenizer() changes by @ekmb :: PR: #3932
- Fix tarred sentence dataset length by @MaximumEntropy :: PR: #3941
- remove old doc by @ekmb :: PR: #3946
- Fix issues with librosa deprecations by @titu1994 :: PR: #3950
- Fix notebook bugs for branch r1.8.0 by @yidong72 :: PR: #3948
- Fix global batch fit loop by @ericharper :: PR: #3936
- Refactor restorefrom by @ramanathan831 :: PR: #3927
- Fix variable name and move models to CPU in Change partition by @aklife97 :: PR: #3972
- Fix notebook error by @yidong72 :: PR: #3975
- Notebook Bug Fixes for r1.8.0 by @vadam5 :: PR: #3989
- Fix compat override for TalkNet Aligner by @redoctopus :: PR: #3993
- docs fixes by @ekmb :: PR: #3987
- Fixes val_check_interval, skip loading train data during eval by @MaximumEntropy :: PR: #3968
- LogProb calculation performance fix by @yidong72 :: PR: #3984
- Fix P-Tune T5 model by @yidong72 :: PR: #4001
- Fix the broadcast shape mismatch by @yidong72 :: PR: #4017
- Add known issues to notebook by @ericharper :: PR: #4024