MyGit

v1.9.0

NVIDIA/NeMo

版本发布时间: 2022-06-04 04:40:57

NVIDIA/NeMo最新发布版本:r2.0.0rc1(2024-08-16 05:55:14)

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.04

ASR

Changelog
  • Fix changed function name in offline vad asr notebeook by @fayejf :: PR: #4007
  • NeMo Adapters Support + ASR Adapters by @titu1994 :: PR: #3942
  • Update ASR configs with num_workers and pin_memory by @titu1994 :: PR: #4270
  • Verbose k2 install, skip if failed by @GNroy :: PR: #4289
  • Torch conversion for VAD-Diarization pipeline by @tango4j :: PR: #3930
  • Multiprocess improvements by @nithinraok :: PR: #4127

TTS

Changelog
  • Tn tts e by @ekmb :: PR: #3988
  • Remove AudioToCharWithPriorAndPitchDataset dependency from fastpitch by @subhankar-ghosh :: PR: #4008
  • Deprecation by @blisc :: PR: #4082
  • FastPitch FT notebook - Improving Speech Quality clarifications by @redoctopus :: PR: #3954

NLP / NMT

Changelog
  • Option to remove bias terms from Megatron transformers by @MaximumEntropy :: PR: #3973
  • Add NMT method to translate with TN/ITN pre/post-processing by @MaximumEntropy :: PR: #4009
  • Fix Punctuation and Capitalization model batching. An issue with shuffling. by @PeganovAnton :: PR: #4050
  • Fix GPT model parallel eval by @yidong72 :: PR: #4054
  • Updating with main by @jpilaul :: PR: #4073
  • Cherry-pick fix for megatron ckpt conversion script when using BCP by @ericharper :: PR: #4089
  • Check implicit grad acc in GLUE dataset building by @MaximumEntropy :: PR: #4123
  • Fix/punctuation avoid overwritting tmp files by @PeganovAnton :: PR: #4144
  • Fix/punctuation/trainer required for setting test data by @PeganovAnton :: PR: #4199
  • Raise error if bicleaner is not installed in NMT Data preprocesing notebook by @MaximumEntropy :: PR: #4264
  • Fix epoch end for NeMo NMT by @MaximumEntropy :: PR: #4265
  • Update YAML with trainer.benchmark=False for NLP by @MaximumEntropy :: PR: #4261
  • Add NMT method to translate with TN/ITN pre/post-processing by @MaximumEntropy :: PR: #4009
  • Continuous prompt refactor by @vadam5 :: PR: #3877
  • T5 finetuning for generic small text-to-text datasets by @MaximumEntropy :: PR: #4032

Text Normalization / Inverse Text Normalization

Changelog
  • Tn special text support by @yzhang123 :: PR: #3969
  • Tn update numbers by @yzhang123 :: PR: #3992
  • Tn tts e by @ekmb :: PR: #3988
  • Itn vi by @yzhang123 :: PR: #4029
  • Refactor tn data folder, and update of measure by @yzhang123 :: PR: #4028
  • Remove conda dependency for tn by @yzhang123 :: PR: #4057
  • Tn electronic by @yzhang123 :: PR: #4053
  • ThutmoseTaggerModel, a new model for inverse text normalization by @bene-ges :: PR: #4011
  • Tutorial on ITN with Thutmose tagger and small fixes by @bene-ges :: PR: #4117
  • Cleaned up TN/ ITN doc by @yzhang123 :: PR: #4119
  • Update default for SH by @ekmb :: PR: #4135
  • Update ContextNet version by @titu1994 :: PR: #4207

NeMo Tools

Changelog
  • Added exception handling for audio player in SDE by @vsl9 :: PR: #4077

NeMo Core

Changelog
  • Support pre-extracted nemo checkpoint for restoration by @titu1994 :: PR: #4061
  • Fix type checking to be compatible with named tuples by @artbataev :: PR: #3986
  • Update num worker calculation due to PTL flag changes by @redoctopus :: PR: #4056
  • Refresh NeMo documentation to Sphinx Book Theme by @titu1994 :: PR: #3996
  • Generalize adapter merge strategy for future adapters by @titu1994 :: PR: #4091

General Improvements

Changelog
  • Fix Punctuation and Capitalization model batching. An issue with shuffling. by @PeganovAnton :: PR: #4050
  • Fix restoring from checkpoint for case when is provided by @PeganovAnton :: PR: #4136
  • Fix/punctuation avoid overwritting tmp files by @PeganovAnton :: PR: #4144
  • Fix/punctuation/trainer required for setting test data by @PeganovAnton :: PR: #4199
  • Ability to set log_prediction to false by @bmwshop :: PR: #3929
  • Glu activation variants by @MaximumEntropy :: PR: #3951
  • Ranking merge by @yzhang123 :: PR: #3906
  • Fix path in doc by @nithinraok :: PR: #3979
  • Adding fisher audio conversion script from old NeMo branch by @jbalam-nv :: PR: #3991
  • improvements to geet_commonvoice_data script by @bmwshop :: PR: #3999
  • Bugfix and variable name change for clustering code by @tango4j :: PR: #4023
  • Exp manager log rank 0 only arguments by @MaximumEntropy :: PR: #4026
  • Force import test on PR by @titu1994 :: PR: #4037
  • Drop support for kaldi-io by @titu1994 :: PR: #4042
  • Cherry pick HF integration and bug fixes from 1.8.1 by @ericharper :: PR: #4052
  • Make saving prompt encoder embeddings non-configurable by @vadam5 :: PR: #4071
  • Replace sampled tokens with EOD after EOD has been sampled once by @vadam5 :: PR: #4070
  • Added answer only loss for prompt learning by @vadam5 :: PR: #4069
  • added stacking suport to conformer. by @VahidooX :: PR: #4045
  • Update LJSpeech whitelist file path by @redoctopus :: PR: #4078
  • Added check for microbatch calculator by @vadam5 :: PR: #4043
  • Prompt Learning Docs by @vadam5 :: PR: #4046
  • Fix link to prompt tuning page by @SeanNaren :: PR: #4081
  • Add docs for by @titu1994 :: PR: #4079
  • Dialogue task by @Zhilin123 :: PR: #3884
  • RMSNorm, Normformer and fixes from merging 1.8.0 into main by @MaximumEntropy :: PR: #4048
  • Correct link to PTL by @titu1994 :: PR: #4088
  • Added encoder and decoder modules for RETRO model by @yidong72 :: PR: #4038
  • Upgrade container to NGC PyTorch 22.04 by @ericharper :: PR: #4085
  • Tarred fix label models by @nithinraok :: PR: #4092
  • Fix link to tutorial in dialogue docs by @Zhilin123 :: PR: #4093
  • Prompt learning Notebook by @vadam5 :: PR: #4031
  • Add more papers by @yzhang123 :: PR: #4097
  • Ignore speakers with few utterances by @nithinraok :: PR: #3722
  • Access mixin by @sam1373 :: PR: #4098
  • Add CharParser for Cyrillic letters by @karpov-nick :: PR: #4101
  • Restored tests previously disabled for 22.03 base by @borisfom :: PR: #4109
  • Add augmentation to label models by @nithinraok :: PR: #4113
  • Fix register artifacts by @ramanathan831 :: PR: #4116
  • Fix typo by @yzhang123 :: PR: #4140
  • bug_fix_diarization_manifest_creation by @yzhang123 :: PR: #4125
  • Tacotron2 retrain by @treacker :: PR: #4103
  • WaveGlow input type fixes by @redoctopus :: PR: #4151
  • Notebooks' link, typo and import fix by @fayejf :: PR: #4158
  • Thutmose tagger bug fixes by @bene-ges :: PR: #4162
  • Update speaker docs by @nithinraok :: PR: #4164
  • Set plugin to None when no apex by @ekmb :: PR: #4171
  • Fix doc by @yzhang123 :: PR: #4152
  • Small import name fix by @fayejf :: PR: #4180
  • Rename folder VAD -> vad by @fayejf :: PR: #4163
  • Fix the server key value problem in the notebook by @yidong72 :: PR: #4196
  • Pin omegaconf for r1.9.0 by @ericharper :: PR: #4195
  • Fix cherrypicks by @titu1994 :: PR: #4204
  • Fix bugs for dialogue tutorial by @Zhilin123 :: PR: #4211
  • Tacotron2 1.9.0 bugfixes by @redoctopus :: PR: #4209
  • Add docs for Thutmose Tagger by @bene-ges :: PR: #4173
  • Dialogue tutorial fix by @Zhilin123 :: PR: #4221
  • Fix syntax error in ipynb-file by @bene-ges :: PR: #4228
  • Fix JSON serialization problem by @yidong72 :: PR: #4235
  • Prompt Learning Typo Fixes by @vadam5 :: PR: #4238
  • Fixing bug 3642622 by @pasandi20 :: PR: #4250
  • Fix broken link in the tutorial by @bene-ges :: PR: #4257
  • Prompt learning notebook bugfix by @vadam5 :: PR: #4262
  • Fix missing validation dataset, whitelist certain keywords for datasets by @titu1994 :: PR: #4269
  • Set Save on train end to false by @vadam5 :: PR: #4274
  • Updated config to fix CI test OOM error by @vadam5 :: PR: #4279
  • Changed total virtual prompt tokens by @vadam5 :: PR: #4295

相关地址:原始地址 下载(tar) 下载(zip)

查看:2022-06-04发行的版本