v1.9.0
版本发布时间: 2022-06-04 04:40:57
NVIDIA/NeMo最新发布版本:r2.0.0rc1(2024-08-16 05:55:14)
Container
For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo
docker pull nvcr.io/nvidia/nemo:22.04
ASR
Changelog
- Fix changed function name in offline vad asr notebeook by @fayejf :: PR: #4007
- NeMo Adapters Support + ASR Adapters by @titu1994 :: PR: #3942
- Update ASR configs with num_workers and pin_memory by @titu1994 :: PR: #4270
- Verbose k2 install, skip if failed by @GNroy :: PR: #4289
- Torch conversion for VAD-Diarization pipeline by @tango4j :: PR: #3930
- Multiprocess improvements by @nithinraok :: PR: #4127
TTS
Changelog
- Tn tts e by @ekmb :: PR: #3988
- Remove AudioToCharWithPriorAndPitchDataset dependency from fastpitch by @subhankar-ghosh :: PR: #4008
- Deprecation by @blisc :: PR: #4082
- FastPitch FT notebook - Improving Speech Quality clarifications by @redoctopus :: PR: #3954
NLP / NMT
Changelog
- Option to remove bias terms from Megatron transformers by @MaximumEntropy :: PR: #3973
- Add NMT method to translate with TN/ITN pre/post-processing by @MaximumEntropy :: PR: #4009
- Fix Punctuation and Capitalization model batching. An issue with shuffling. by @PeganovAnton :: PR: #4050
- Fix GPT model parallel eval by @yidong72 :: PR: #4054
- Updating with main by @jpilaul :: PR: #4073
- Cherry-pick fix for megatron ckpt conversion script when using BCP by @ericharper :: PR: #4089
- Check implicit grad acc in GLUE dataset building by @MaximumEntropy :: PR: #4123
- Fix/punctuation avoid overwritting tmp files by @PeganovAnton :: PR: #4144
- Fix/punctuation/trainer required for setting test data by @PeganovAnton :: PR: #4199
- Raise error if bicleaner is not installed in NMT Data preprocesing notebook by @MaximumEntropy :: PR: #4264
- Fix epoch end for NeMo NMT by @MaximumEntropy :: PR: #4265
- Update YAML with trainer.benchmark=False for NLP by @MaximumEntropy :: PR: #4261
- Add NMT method to translate with TN/ITN pre/post-processing by @MaximumEntropy :: PR: #4009
- Continuous prompt refactor by @vadam5 :: PR: #3877
- T5 finetuning for generic small text-to-text datasets by @MaximumEntropy :: PR: #4032
Text Normalization / Inverse Text Normalization
Changelog
- Tn special text support by @yzhang123 :: PR: #3969
- Tn update numbers by @yzhang123 :: PR: #3992
- Tn tts e by @ekmb :: PR: #3988
- Itn vi by @yzhang123 :: PR: #4029
- Refactor tn data folder, and update of measure by @yzhang123 :: PR: #4028
- Remove conda dependency for tn by @yzhang123 :: PR: #4057
- Tn electronic by @yzhang123 :: PR: #4053
- ThutmoseTaggerModel, a new model for inverse text normalization by @bene-ges :: PR: #4011
- Tutorial on ITN with Thutmose tagger and small fixes by @bene-ges :: PR: #4117
- Cleaned up TN/ ITN doc by @yzhang123 :: PR: #4119
- Update default for SH by @ekmb :: PR: #4135
- Update ContextNet version by @titu1994 :: PR: #4207
NeMo Tools
Changelog
- Added exception handling for audio player in SDE by @vsl9 :: PR: #4077
NeMo Core
Changelog
- Support pre-extracted nemo checkpoint for restoration by @titu1994 :: PR: #4061
- Fix type checking to be compatible with named tuples by @artbataev :: PR: #3986
- Update num worker calculation due to PTL flag changes by @redoctopus :: PR: #4056
- Refresh NeMo documentation to Sphinx Book Theme by @titu1994 :: PR: #3996
- Generalize adapter merge strategy for future adapters by @titu1994 :: PR: #4091
General Improvements
Changelog
- Fix Punctuation and Capitalization model batching. An issue with shuffling. by @PeganovAnton :: PR: #4050
- Fix restoring from checkpoint for case when is provided by @PeganovAnton :: PR: #4136
- Fix/punctuation avoid overwritting tmp files by @PeganovAnton :: PR: #4144
- Fix/punctuation/trainer required for setting test data by @PeganovAnton :: PR: #4199
- Ability to set log_prediction to false by @bmwshop :: PR: #3929
- Glu activation variants by @MaximumEntropy :: PR: #3951
- Ranking merge by @yzhang123 :: PR: #3906
- Fix path in doc by @nithinraok :: PR: #3979
- Adding fisher audio conversion script from old NeMo branch by @jbalam-nv :: PR: #3991
- improvements to geet_commonvoice_data script by @bmwshop :: PR: #3999
- Bugfix and variable name change for clustering code by @tango4j :: PR: #4023
- Exp manager log rank 0 only arguments by @MaximumEntropy :: PR: #4026
- Force import test on PR by @titu1994 :: PR: #4037
- Drop support for kaldi-io by @titu1994 :: PR: #4042
- Cherry pick HF integration and bug fixes from 1.8.1 by @ericharper :: PR: #4052
- Make saving prompt encoder embeddings non-configurable by @vadam5 :: PR: #4071
- Replace sampled tokens with EOD after EOD has been sampled once by @vadam5 :: PR: #4070
- Added answer only loss for prompt learning by @vadam5 :: PR: #4069
- added stacking suport to conformer. by @VahidooX :: PR: #4045
- Update LJSpeech whitelist file path by @redoctopus :: PR: #4078
- Added check for microbatch calculator by @vadam5 :: PR: #4043
- Prompt Learning Docs by @vadam5 :: PR: #4046
- Fix link to prompt tuning page by @SeanNaren :: PR: #4081
- Add docs for by @titu1994 :: PR: #4079
- Dialogue task by @Zhilin123 :: PR: #3884
- RMSNorm, Normformer and fixes from merging 1.8.0 into main by @MaximumEntropy :: PR: #4048
- Correct link to PTL by @titu1994 :: PR: #4088
- Added encoder and decoder modules for RETRO model by @yidong72 :: PR: #4038
- Upgrade container to NGC PyTorch 22.04 by @ericharper :: PR: #4085
- Tarred fix label models by @nithinraok :: PR: #4092
- Fix link to tutorial in dialogue docs by @Zhilin123 :: PR: #4093
- Prompt learning Notebook by @vadam5 :: PR: #4031
- Add more papers by @yzhang123 :: PR: #4097
- Ignore speakers with few utterances by @nithinraok :: PR: #3722
- Access mixin by @sam1373 :: PR: #4098
- Add CharParser for Cyrillic letters by @karpov-nick :: PR: #4101
- Restored tests previously disabled for 22.03 base by @borisfom :: PR: #4109
- Add augmentation to label models by @nithinraok :: PR: #4113
- Fix register artifacts by @ramanathan831 :: PR: #4116
- Fix typo by @yzhang123 :: PR: #4140
- bug_fix_diarization_manifest_creation by @yzhang123 :: PR: #4125
- Tacotron2 retrain by @treacker :: PR: #4103
- WaveGlow input type fixes by @redoctopus :: PR: #4151
- Notebooks' link, typo and import fix by @fayejf :: PR: #4158
- Thutmose tagger bug fixes by @bene-ges :: PR: #4162
- Update speaker docs by @nithinraok :: PR: #4164
- Set plugin to None when no apex by @ekmb :: PR: #4171
- Fix doc by @yzhang123 :: PR: #4152
- Small import name fix by @fayejf :: PR: #4180
- Rename folder VAD -> vad by @fayejf :: PR: #4163
- Fix the server key value problem in the notebook by @yidong72 :: PR: #4196
- Pin omegaconf for r1.9.0 by @ericharper :: PR: #4195
- Fix cherrypicks by @titu1994 :: PR: #4204
- Fix bugs for dialogue tutorial by @Zhilin123 :: PR: #4211
- Tacotron2 1.9.0 bugfixes by @redoctopus :: PR: #4209
- Add docs for Thutmose Tagger by @bene-ges :: PR: #4173
- Dialogue tutorial fix by @Zhilin123 :: PR: #4221
- Fix syntax error in ipynb-file by @bene-ges :: PR: #4228
- Fix JSON serialization problem by @yidong72 :: PR: #4235
- Prompt Learning Typo Fixes by @vadam5 :: PR: #4238
- Fixing bug 3642622 by @pasandi20 :: PR: #4250
- Fix broken link in the tutorial by @bene-ges :: PR: #4257
- Prompt learning notebook bugfix by @vadam5 :: PR: #4262
- Fix missing validation dataset, whitelist certain keywords for datasets by @titu1994 :: PR: #4269
- Set Save on train end to false by @vadam5 :: PR: #4274
- Updated config to fix CI test OOM error by @vadam5 :: PR: #4279
- Changed total virtual prompt tokens by @vadam5 :: PR: #4295