v1.8.0

版本发布时间: 2022-04-20 12:29:29

NVIDIA/NeMo最新发布版本:r2.0.0rc1(2024-08-16 05:55:14)

Known Issues

Issues

Megatron BERT export does not currently work in the NVIDIA NGC PyTorch 22.03 container. The issue will be fixed in the NGC PyTorch 22.04 container.
pytest for Vietnamese inverse text normalization are failing. Fixed in main

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.03

ASR

Changelog

ASR SSL Update by @sam1373 :: PR: #3714
Polylang asr by @bmwshop :: PR: #3721
Test grad accumulation for RNNT loss by @titu1994 :: PR: #3731
Add readme files describing model execution flow for ASR tasks by @titu1994 :: PR: #3812
add fr asr ckpt to doc by @yzhang123 :: PR: #3809
Fix asr tests in 22.02 by @titu1994 :: PR: #3823
Add new pretrained Spanish ASR models by @erastorgueva-nv :: PR: #3830
Documentation updates for ASR by @titu1994 :: PR: #3846
Offline VAD+ASR tutorial by @fayejf :: PR: #3828
Added Hindi and Marathi Models in Nemo pretrained ASR_CTC_BPE models … by @meghmak13 :: PR: #3856
Add a missing line to ASR_with_NeMo.ipynb by @lifefeel :: PR: #3908
Multilang asr models by @bmwshop :: PR: #3907
added stt_en_conformer_transducer_large_ls to NGC by @VahidooX :: PR: #3920
Fix DALI test on 22.03 by @titu1994 :: PR: #3911
Adding RNN encoder for LSTM-Transducer and LSTM-CTC models by @VahidooX :: PR: #3886
Fix issue with Segfault in ASR models by @titu1994 :: PR: #3956
Added Mandarin pretrained Conformer-Transducer-Large model trained on AISHELL2. by @VahidooX :: PR: #3970

TTS

Changelog

Bump TTS deprecation version to 1.9 by @blisc :: PR: #3955
Add pinned pynini and scipy installs to TTS training tutorial by @redoctopus :: PR: #3967
Compatability override to load_state_dict for old TTS checkpoints by @redoctopus :: PR: #3978

NLP / NMT

Changelog

Use worker processes for data preprocessing by @crcrpar :: PR: #3665
Set find_unused_parameters to False in GPT example script by @ericharper :: PR: #3837
GPT multinode eval by @ericharper :: PR: #3821
Fix MegatronPretrainingRandomSampler by taking into account by @crcrpar :: PR: #3826
Add slot filling into DST Generative model by @Zhilin123 :: PR: #3695
Disable nvfuser for gpt by @ericharper :: PR: #3845
Multi-Label Joint Intent Slot Classification by @chenrichard10 :: PR: #3742
fix bug in intent/slot model reloading by @carolmanderson :: PR: #3874
Make test_gpt_eval unit test less strict by @yidong72 :: PR: #3898
Comment gpt resume ci test by @MaximumEntropy :: PR: #3901
Neural Machine Translation with Megatron Transformer Models (Tensor Parallel and Tarred Datasets Only) by @MaximumEntropy :: PR: #3861
Megatron support by @ramanathan831 :: PR: #3893
Populate the GPT/BERT with uploaded models by @yidong72 :: PR: #3885
Megatron BART by @michalivne :: PR: #3666
Additional Japanese processor for NMT that uses MeCab segmentation. Fix for BLEU in one-many NMT by @MaximumEntropy :: PR: #3889
NMT GRPC sever URL fix by @MaximumEntropy :: PR: #3918
Megatron legacy conversion support by @ramanathan831 :: PR: #3919
Update max_epochs on megatron configs by @ericharper :: PR: #3958
Fix NMT variable passing bug by @aklife97 :: PR: #3985
Fix nemo megatron restore with artifacts by @ericharper :: PR: #3997
Fix megatron notebook by @ramanathan831 :: PR: #4004
Megatron work-arounds by @borisfom :: PR: #3998
Add T5 model P-tuning support by @yidong72 :: PR: #3768
Make index mappings dir configurable by @ericharper :: PR: #3868
T5 pipeline parallel by @MaximumEntropy :: PR: #3750

Text Normalization / Inverse Text Normalization

Changelog

Tn es by @bonham79 :: PR: #3632
Fix single GPU training issue + change deprecated Lightning args by @aklife97 :: PR: #4010

Export

Changelog

Conformer WARs for TRT8.2 by @borisfom :: PR: #3787
bert_module: fix inputs of export model by @virajkarandikar :: PR: #3815
Exports 22.03 war by @borisfom :: PR: #3957

Bugfixes

Changelog

patch librosa deprecation and fix by @fayejf :: PR: #3818

General Improvements

Changelog

Pynini pip by @yzhang123 :: PR: #3726
upgrade PTL trainer flags by @nithinraok :: PR: #3589
Updated Speech Data Explorer by @vsl9 :: PR: #3710
Fix spelling error in num_workers parameter to actually set number of dataset workers specified in yaml configs by @themikem :: PR: #3800
Support for Camembert Huggingface bert-like models by @itzsimpl :: PR: #3799
Update to 22.02 by @ericharper :: PR: #3771
Fixing the defaults of conformer models in the config files by @VahidooX :: PR: #3836
Fix T5 Encoder Mask while decoding by @MaximumEntropy :: PR: #3838
fix: multilingual transcribe does not require lang id param by @bmwshop :: PR: #3833
Misc improvements by @titu1994 :: PR: #3843
Change container by @MaximumEntropy :: PR: #3844
Making gender assignment random for cardinals, fractions, and decimal… by @bonham79 :: PR: #3759
Jenkinsfile test changes by @chenrichard10 :: PR: #3879
Adding a RegEx tokenizers by @michalivne :: PR: #3839
enable bias+dropout+add fusion with nvfuser at inference by @erhoo82 :: PR: #3869
Add text_generation_util to support TopK, TopP sampling + Tabular Data Generation. by @yidong72 :: PR: #3834
Ptl requirements bound by @MaximumEntropy :: PR: #3903
doc links update by @ekmb :: PR: #3891
add citations by @yzhang123 :: PR: #3902
Update NeMo CI to 22.03 by @MaximumEntropy :: PR: #3900
Add domain groups to changelog builder by @titu1994 :: PR: #3904
add input threshhold by @yzhang123 :: PR: #3913
improvements to commonvoice data script by @bmwshop :: PR: #3892
fixes to the cleanup flag by @bmwshop :: PR: #3921
Upgrade to PTL 1.6.0 by @ericharper :: PR: #3890
JSON output from diarization now includes sentences. Optimized senten… by @demsarjure :: PR: #3897
Stateless timer fix for PTL 1.6 by @MaximumEntropy :: PR: #3925
fix save_best missing chpt bug, update for setup_tokenizer() changes by @ekmb :: PR: #3932
Fix tarred sentence dataset length by @MaximumEntropy :: PR: #3941
remove old doc by @ekmb :: PR: #3946
Fix issues with librosa deprecations by @titu1994 :: PR: #3950
Fix notebook bugs for branch r1.8.0 by @yidong72 :: PR: #3948
Fix global batch fit loop by @ericharper :: PR: #3936
Refactor restorefrom by @ramanathan831 :: PR: #3927
Fix variable name and move models to CPU in Change partition by @aklife97 :: PR: #3972
Fix notebook error by @yidong72 :: PR: #3975
Notebook Bug Fixes for r1.8.0 by @vadam5 :: PR: #3989
Fix compat override for TalkNet Aligner by @redoctopus :: PR: #3993
docs fixes by @ekmb :: PR: #3987
Fixes val_check_interval, skip loading train data during eval by @MaximumEntropy :: PR: #3968
LogProb calculation performance fix by @yidong72 :: PR: #3984
Fix P-Tune T5 model by @yidong72 :: PR: #4001
Fix the broadcast shape mismatch by @yidong72 :: PR: #4017
Add known issues to notebook by @ericharper :: PR: #4024

相关地址：原始地址下载(tar) 下载(zip)

查看：2022-04-20发行的版本