v1.9.0

版本发布时间: 2022-06-04 04:40:57

NVIDIA/NeMo最新发布版本:r2.0.0rc1(2024-08-16 05:55:14)

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:22.04

ASR

Changelog

Fix changed function name in offline vad asr notebeook by @fayejf :: PR: #4007
NeMo Adapters Support + ASR Adapters by @titu1994 :: PR: #3942
Update ASR configs with num_workers and pin_memory by @titu1994 :: PR: #4270
Verbose k2 install, skip if failed by @GNroy :: PR: #4289
Torch conversion for VAD-Diarization pipeline by @tango4j :: PR: #3930
Multiprocess improvements by @nithinraok :: PR: #4127

TTS

Changelog

Tn tts e by @ekmb :: PR: #3988
Remove AudioToCharWithPriorAndPitchDataset dependency from fastpitch by @subhankar-ghosh :: PR: #4008
Deprecation by @blisc :: PR: #4082
FastPitch FT notebook - Improving Speech Quality clarifications by @redoctopus :: PR: #3954

NLP / NMT

Changelog

Option to remove bias terms from Megatron transformers by @MaximumEntropy :: PR: #3973
Add NMT method to translate with TN/ITN pre/post-processing by @MaximumEntropy :: PR: #4009
Fix Punctuation and Capitalization model batching. An issue with shuffling. by @PeganovAnton :: PR: #4050
Fix GPT model parallel eval by @yidong72 :: PR: #4054
Updating with main by @jpilaul :: PR: #4073
Cherry-pick fix for megatron ckpt conversion script when using BCP by @ericharper :: PR: #4089
Check implicit grad acc in GLUE dataset building by @MaximumEntropy :: PR: #4123
Fix/punctuation avoid overwritting tmp files by @PeganovAnton :: PR: #4144
Fix/punctuation/trainer required for setting test data by @PeganovAnton :: PR: #4199
Raise error if bicleaner is not installed in NMT Data preprocesing notebook by @MaximumEntropy :: PR: #4264
Fix epoch end for NeMo NMT by @MaximumEntropy :: PR: #4265
Update YAML with trainer.benchmark=False for NLP by @MaximumEntropy :: PR: #4261
Add NMT method to translate with TN/ITN pre/post-processing by @MaximumEntropy :: PR: #4009
Continuous prompt refactor by @vadam5 :: PR: #3877
T5 finetuning for generic small text-to-text datasets by @MaximumEntropy :: PR: #4032

Text Normalization / Inverse Text Normalization

Changelog

Tn special text support by @yzhang123 :: PR: #3969
Tn update numbers by @yzhang123 :: PR: #3992
Tn tts e by @ekmb :: PR: #3988
Itn vi by @yzhang123 :: PR: #4029
Refactor tn data folder, and update of measure by @yzhang123 :: PR: #4028
Remove conda dependency for tn by @yzhang123 :: PR: #4057
Tn electronic by @yzhang123 :: PR: #4053
ThutmoseTaggerModel, a new model for inverse text normalization by @bene-ges :: PR: #4011
Tutorial on ITN with Thutmose tagger and small fixes by @bene-ges :: PR: #4117
Cleaned up TN/ ITN doc by @yzhang123 :: PR: #4119
Update default for SH by @ekmb :: PR: #4135
Update ContextNet version by @titu1994 :: PR: #4207

NeMo Tools

Changelog

Added exception handling for audio player in SDE by @vsl9 :: PR: #4077

NeMo Core

Changelog

Support pre-extracted nemo checkpoint for restoration by @titu1994 :: PR: #4061
Fix type checking to be compatible with named tuples by @artbataev :: PR: #3986
Update num worker calculation due to PTL flag changes by @redoctopus :: PR: #4056
Refresh NeMo documentation to Sphinx Book Theme by @titu1994 :: PR: #3996
Generalize adapter merge strategy for future adapters by @titu1994 :: PR: #4091

General Improvements

Changelog

Fix Punctuation and Capitalization model batching. An issue with shuffling. by @PeganovAnton :: PR: #4050
Fix restoring from checkpoint for case when is provided by @PeganovAnton :: PR: #4136
Fix/punctuation avoid overwritting tmp files by @PeganovAnton :: PR: #4144
Fix/punctuation/trainer required for setting test data by @PeganovAnton :: PR: #4199
Ability to set log_prediction to false by @bmwshop :: PR: #3929
Glu activation variants by @MaximumEntropy :: PR: #3951
Ranking merge by @yzhang123 :: PR: #3906
Fix path in doc by @nithinraok :: PR: #3979
Adding fisher audio conversion script from old NeMo branch by @jbalam-nv :: PR: #3991
improvements to geet_commonvoice_data script by @bmwshop :: PR: #3999
Bugfix and variable name change for clustering code by @tango4j :: PR: #4023
Exp manager log rank 0 only arguments by @MaximumEntropy :: PR: #4026
Force import test on PR by @titu1994 :: PR: #4037
Drop support for kaldi-io by @titu1994 :: PR: #4042
Cherry pick HF integration and bug fixes from 1.8.1 by @ericharper :: PR: #4052
Make saving prompt encoder embeddings non-configurable by @vadam5 :: PR: #4071
Replace sampled tokens with EOD after EOD has been sampled once by @vadam5 :: PR: #4070
Added answer only loss for prompt learning by @vadam5 :: PR: #4069
added stacking suport to conformer. by @VahidooX :: PR: #4045
Update LJSpeech whitelist file path by @redoctopus :: PR: #4078
Added check for microbatch calculator by @vadam5 :: PR: #4043
Prompt Learning Docs by @vadam5 :: PR: #4046
Fix link to prompt tuning page by @SeanNaren :: PR: #4081
Add docs for by @titu1994 :: PR: #4079
Dialogue task by @Zhilin123 :: PR: #3884
RMSNorm, Normformer and fixes from merging 1.8.0 into main by @MaximumEntropy :: PR: #4048
Correct link to PTL by @titu1994 :: PR: #4088
Added encoder and decoder modules for RETRO model by @yidong72 :: PR: #4038
Upgrade container to NGC PyTorch 22.04 by @ericharper :: PR: #4085
Tarred fix label models by @nithinraok :: PR: #4092
Fix link to tutorial in dialogue docs by @Zhilin123 :: PR: #4093
Prompt learning Notebook by @vadam5 :: PR: #4031
Add more papers by @yzhang123 :: PR: #4097
Ignore speakers with few utterances by @nithinraok :: PR: #3722
Access mixin by @sam1373 :: PR: #4098
Add CharParser for Cyrillic letters by @karpov-nick :: PR: #4101
Restored tests previously disabled for 22.03 base by @borisfom :: PR: #4109
Add augmentation to label models by @nithinraok :: PR: #4113
Fix register artifacts by @ramanathan831 :: PR: #4116
Fix typo by @yzhang123 :: PR: #4140
bug_fix_diarization_manifest_creation by @yzhang123 :: PR: #4125
Tacotron2 retrain by @treacker :: PR: #4103
WaveGlow input type fixes by @redoctopus :: PR: #4151
Notebooks' link, typo and import fix by @fayejf :: PR: #4158
Thutmose tagger bug fixes by @bene-ges :: PR: #4162
Update speaker docs by @nithinraok :: PR: #4164
Set plugin to None when no apex by @ekmb :: PR: #4171
Fix doc by @yzhang123 :: PR: #4152
Small import name fix by @fayejf :: PR: #4180
Rename folder VAD -> vad by @fayejf :: PR: #4163
Fix the server key value problem in the notebook by @yidong72 :: PR: #4196
Pin omegaconf for r1.9.0 by @ericharper :: PR: #4195
Fix cherrypicks by @titu1994 :: PR: #4204
Fix bugs for dialogue tutorial by @Zhilin123 :: PR: #4211
Tacotron2 1.9.0 bugfixes by @redoctopus :: PR: #4209
Add docs for Thutmose Tagger by @bene-ges :: PR: #4173
Dialogue tutorial fix by @Zhilin123 :: PR: #4221
Fix syntax error in ipynb-file by @bene-ges :: PR: #4228
Fix JSON serialization problem by @yidong72 :: PR: #4235
Prompt Learning Typo Fixes by @vadam5 :: PR: #4238
Fixing bug 3642622 by @pasandi20 :: PR: #4250
Fix broken link in the tutorial by @bene-ges :: PR: #4257
Prompt learning notebook bugfix by @vadam5 :: PR: #4262
Fix missing validation dataset, whitelist certain keywords for datasets by @titu1994 :: PR: #4269
Set Save on train end to false by @vadam5 :: PR: #4274
Updated config to fix CI test OOM error by @vadam5 :: PR: #4279
Changed total virtual prompt tokens by @vadam5 :: PR: #4295

相关地址：原始地址下载(tar) 下载(zip)

查看：2022-06-04发行的版本