MyGit

v1.8.0

NVIDIA/NeMo

版本发布时间: 2022-04-20 12:29:29

NVIDIA/NeMo最新发布版本:r2.0.0rc1(2024-08-16 05:55:14)

Known Issues

Issues
  • Megatron BERT export does not currently work in the NVIDIA NGC PyTorch 22.03 container. The issue will be fixed in the NGC PyTorch 22.04 container.
  • pytest for Vietnamese inverse text normalization are failing. Fixed in main

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.03

ASR

Changelog
  • ASR SSL Update by @sam1373 :: PR: #3714
  • Polylang asr by @bmwshop :: PR: #3721
  • Test grad accumulation for RNNT loss by @titu1994 :: PR: #3731
  • Add readme files describing model execution flow for ASR tasks by @titu1994 :: PR: #3812
  • add fr asr ckpt to doc by @yzhang123 :: PR: #3809
  • Fix asr tests in 22.02 by @titu1994 :: PR: #3823
  • Add new pretrained Spanish ASR models by @erastorgueva-nv :: PR: #3830
  • Documentation updates for ASR by @titu1994 :: PR: #3846
  • Offline VAD+ASR tutorial by @fayejf :: PR: #3828
  • Added Hindi and Marathi Models in Nemo pretrained ASR_CTC_BPE models … by @meghmak13 :: PR: #3856
  • Add a missing line to ASR_with_NeMo.ipynb by @lifefeel :: PR: #3908
  • Multilang asr models by @bmwshop :: PR: #3907
  • added stt_en_conformer_transducer_large_ls to NGC by @VahidooX :: PR: #3920
  • Fix DALI test on 22.03 by @titu1994 :: PR: #3911
  • Adding RNN encoder for LSTM-Transducer and LSTM-CTC models by @VahidooX :: PR: #3886
  • Fix issue with Segfault in ASR models by @titu1994 :: PR: #3956
  • Added Mandarin pretrained Conformer-Transducer-Large model trained on AISHELL2. by @VahidooX :: PR: #3970

TTS

Changelog
  • Bump TTS deprecation version to 1.9 by @blisc :: PR: #3955
  • Add pinned pynini and scipy installs to TTS training tutorial by @redoctopus :: PR: #3967
  • Compatability override to load_state_dict for old TTS checkpoints by @redoctopus :: PR: #3978

NLP / NMT

Changelog
  • Use worker processes for data preprocessing by @crcrpar :: PR: #3665
  • Set find_unused_parameters to False in GPT example script by @ericharper :: PR: #3837
  • GPT multinode eval by @ericharper :: PR: #3821
  • Fix MegatronPretrainingRandomSampler by taking into account by @crcrpar :: PR: #3826
  • Add slot filling into DST Generative model by @Zhilin123 :: PR: #3695
  • Disable nvfuser for gpt by @ericharper :: PR: #3845
  • Multi-Label Joint Intent Slot Classification by @chenrichard10 :: PR: #3742
  • fix bug in intent/slot model reloading by @carolmanderson :: PR: #3874
  • Make test_gpt_eval unit test less strict by @yidong72 :: PR: #3898
  • Comment gpt resume ci test by @MaximumEntropy :: PR: #3901
  • Neural Machine Translation with Megatron Transformer Models (Tensor Parallel and Tarred Datasets Only) by @MaximumEntropy :: PR: #3861
  • Megatron support by @ramanathan831 :: PR: #3893
  • Populate the GPT/BERT with uploaded models by @yidong72 :: PR: #3885
  • Megatron BART by @michalivne :: PR: #3666
  • Additional Japanese processor for NMT that uses MeCab segmentation. Fix for BLEU in one-many NMT by @MaximumEntropy :: PR: #3889
  • NMT GRPC sever URL fix by @MaximumEntropy :: PR: #3918
  • Megatron legacy conversion support by @ramanathan831 :: PR: #3919
  • Update max_epochs on megatron configs by @ericharper :: PR: #3958
  • Fix NMT variable passing bug by @aklife97 :: PR: #3985
  • Fix nemo megatron restore with artifacts by @ericharper :: PR: #3997
  • Fix megatron notebook by @ramanathan831 :: PR: #4004
  • Megatron work-arounds by @borisfom :: PR: #3998
  • Add T5 model P-tuning support by @yidong72 :: PR: #3768
  • Make index mappings dir configurable by @ericharper :: PR: #3868
  • T5 pipeline parallel by @MaximumEntropy :: PR: #3750

Text Normalization / Inverse Text Normalization

Changelog
  • Tn es by @bonham79 :: PR: #3632
  • Fix single GPU training issue + change deprecated Lightning args by @aklife97 :: PR: #4010

Export

Changelog
  • Conformer WARs for TRT8.2 by @borisfom :: PR: #3787
  • bert_module: fix inputs of export model by @virajkarandikar :: PR: #3815
  • Exports 22.03 war by @borisfom :: PR: #3957

Bugfixes

Changelog
  • patch librosa deprecation and fix by @fayejf :: PR: #3818

General Improvements

Changelog
  • Pynini pip by @yzhang123 :: PR: #3726
  • upgrade PTL trainer flags by @nithinraok :: PR: #3589
  • Updated Speech Data Explorer by @vsl9 :: PR: #3710
  • Fix spelling error in num_workers parameter to actually set number of dataset workers specified in yaml configs by @themikem :: PR: #3800
  • Support for Camembert Huggingface bert-like models by @itzsimpl :: PR: #3799
  • Update to 22.02 by @ericharper :: PR: #3771
  • Fixing the defaults of conformer models in the config files by @VahidooX :: PR: #3836
  • Fix T5 Encoder Mask while decoding by @MaximumEntropy :: PR: #3838
  • fix: multilingual transcribe does not require lang id param by @bmwshop :: PR: #3833
  • Misc improvements by @titu1994 :: PR: #3843
  • Change container by @MaximumEntropy :: PR: #3844
  • Making gender assignment random for cardinals, fractions, and decimal… by @bonham79 :: PR: #3759
  • Jenkinsfile test changes by @chenrichard10 :: PR: #3879
  • Adding a RegEx tokenizers by @michalivne :: PR: #3839
  • enable bias+dropout+add fusion with nvfuser at inference by @erhoo82 :: PR: #3869
  • Add text_generation_util to support TopK, TopP sampling + Tabular Data Generation. by @yidong72 :: PR: #3834
  • Ptl requirements bound by @MaximumEntropy :: PR: #3903
  • doc links update by @ekmb :: PR: #3891
  • add citations by @yzhang123 :: PR: #3902
  • Update NeMo CI to 22.03 by @MaximumEntropy :: PR: #3900
  • Add domain groups to changelog builder by @titu1994 :: PR: #3904
  • add input threshhold by @yzhang123 :: PR: #3913
  • improvements to commonvoice data script by @bmwshop :: PR: #3892
  • fixes to the cleanup flag by @bmwshop :: PR: #3921
  • Upgrade to PTL 1.6.0 by @ericharper :: PR: #3890
  • JSON output from diarization now includes sentences. Optimized senten… by @demsarjure :: PR: #3897
  • Stateless timer fix for PTL 1.6 by @MaximumEntropy :: PR: #3925
  • fix save_best missing chpt bug, update for setup_tokenizer() changes by @ekmb :: PR: #3932
  • Fix tarred sentence dataset length by @MaximumEntropy :: PR: #3941
  • remove old doc by @ekmb :: PR: #3946
  • Fix issues with librosa deprecations by @titu1994 :: PR: #3950
  • Fix notebook bugs for branch r1.8.0 by @yidong72 :: PR: #3948
  • Fix global batch fit loop by @ericharper :: PR: #3936
  • Refactor restorefrom by @ramanathan831 :: PR: #3927
  • Fix variable name and move models to CPU in Change partition by @aklife97 :: PR: #3972
  • Fix notebook error by @yidong72 :: PR: #3975
  • Notebook Bug Fixes for r1.8.0 by @vadam5 :: PR: #3989
  • Fix compat override for TalkNet Aligner by @redoctopus :: PR: #3993
  • docs fixes by @ekmb :: PR: #3987
  • Fixes val_check_interval, skip loading train data during eval by @MaximumEntropy :: PR: #3968
  • LogProb calculation performance fix by @yidong72 :: PR: #3984
  • Fix P-Tune T5 model by @yidong72 :: PR: #4001
  • Fix the broadcast shape mismatch by @yidong72 :: PR: #4017
  • Add known issues to notebook by @ericharper :: PR: #4024

相关地址:原始地址 下载(tar) 下载(zip)

查看:2022-04-20发行的版本