v1.19.0

NVIDIA/NeMo

版本发布时间: 2023-06-16 07:46:05

NVIDIA/NeMo最新发布版本:r2.0.0rc1(2024-08-16 05:55:14)

Highlights

NeMo ASR

Sharded Manifests for Tarred Datasets #6395
Frame-VAD model + datasets support #6441
Noise Norm Perturbation #6445
Code Switched Dataset with IID Sampling #6448

NeMo TTS

Speaker adaptation for FastPitch #6416, #6417

NeMo Megatron

Batch size rampup #6424
Unify dataset and model classes for all PEFT #6391
LoRA for GPT #6391
Convert interleaved pipeline model to non-interleaved #6498
Dialog Dataset for SFT #6654
Dynamic length batches for GPT SFT #6510
Merge LoRA weights into base model #6597

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:23.04

Detailed Changelogs

ASR

Changelog

Sharded manifests for tarred datasets by @bmwshop :: PR: #6395
Update script for ngram rnnt and hat beam search decoding by @andrusenkoau :: PR: #6370
Add disclaimer about dataset for ASR by @titu1994 :: PR: #6496
New noise_norm perturbation based on Riva work by @trias702 :: PR: #6445
Add Frame-VAD model and datasets by @stevehuang52 :: PR: #6441
removing unnecessary avoid_bfloat16_autocast_context by @bmwshop :: PR: #6481
FC models in menu by @bmwshop :: PR: #6473
Separate punctuation by whitespace by @karpnv :: PR: #6574
Cherry pick commits in #6601 to main by @fayejf :: PR: #6611
Offline and streaming inference support for hybrid model by @fayejf :: PR: #6570
Disable interctc tests by @Kipok :: PR: #6638
ASR-TTS Models: Support hybrid RNNT-CTC, improve docs. by @artbataev :: PR: #6620
Confidence ensembles implementation by @Kipok :: PR: #6614
Confidence ensembles: fix issues and add tuning functionality by @Kipok :: PR: #6657
Add support for RNNT/hybrid models to partial transcribe by @stevehuang52 :: PR: #6609
eval_beamsearch_ngram.py with hybrid ctc by @karpnv :: PR: #6656

TTS

Changelog

[TTS] FastPitch adapter fine-tune and conditional layer normalization by @hsiehjackson :: PR: #6416
[TTS] whitelist broken path fix. by @XuesongYang :: PR: #6412
[TTS] FastPitch speaker encoder by @hsiehjackson :: PR: #6417
Update NeMo_TTS_Primer.ipynb by @pythinker :: PR: #6436
[TTS] Create functions for TTS preprocessing without dataloader by @rlangman :: PR: #6317
[TTS] Fix FastPitch energy code by @rlangman :: PR: #6511
[TTS] Add script for computing feature stats by @rlangman :: PR: #6508
[TTS] Add tutorials for FastPitch TTS speaker adaptation with adapters by @hsiehjackson :: PR: #6431
[TTS] Create initial TTS dataset feature processors by @rlangman :: PR: #6507
[TTS] Add script for mapping speaker names to indices by @rlangman :: PR: #6509
[TTS] Implement new TextToSpeech dataset by @rlangman :: PR: #6575

NLP / NMT

Changelog

Add patches for Virtual Parallel conversion by @titu1994 :: PR: #6589
Update wfst_text_normalization.rst by @jimregan :: PR: #6374
add rampup batch size support for Megatron GPT by @dimapihtar :: PR: #6424
Add interleaved pp support by @titu1994 :: PR: #6498
Support dynamic length batches with GPT SFT by @aklife97 :: PR: #6510
Framework for PEFT via mixins by @arendu :: PR: #6391
Add GPT eval mode fix for interleaved to main (#6449) by @aklife97 :: PR: #6610
sft model can use this script for eval by @arendu :: PR: #6637
Patch memory used for NeMo Megatron models by @titu1994 :: PR: #6615
merge lora weights into base model by @arendu :: PR: #6597
Dialogue dataset by @yidong72 :: PR: #6654
check for first or last stage by @ericharper :: PR: #6708
A few small typo fixes by @Kipok :: PR: #6599
Lddl bert by @wdykas :: PR: #6761
Debug Transformer Engine FP8 support with Megatron-core infrastructure by @timmoon10 :: PR: #6740
Tensor-parallel communication overlap with userbuffer backend by @erhoo82 :: PR: #6780
Add ub communicator initialization to validation step by @erhoo82 :: PR: #6807
Add trainer.validate example for GPT by @ericharper :: PR: #6794
Add API docs for NeMo Megatron by @ericharper :: PR: #6850
Apply garbage collection interval to validation steps by @erhoo82 :: PR: #6870

Bugfixes

Changelog

[BugFix] Force _get_batch_preds() to keep logits in decoder timestamps generator by @tango4j :: PR: #6499
small bugfix for asr_evaluator by @fayejf :: PR: #6636
fix bucketing bug issue for picking new bucket by @nithinraok :: PR: #6663
[TTS] Fix TTS audio preprocessing bugs by @rlangman :: PR: #6628
Fix a bug, use _ceil_to_nearest instead as _round_to_nearest is not d… by @BestJuly :: PR: #6681
Bug fix to restore act ckpt by @markelsanz14 :: PR: #6753
Bug fix to reset sequence parallelism by @markelsanz14 :: PR: #6756
Bug fix for reset_sequence_parallel_args by @markelsanz14 :: PR: #6802
Fix adapter tutorial r1.19.0 by @hsiehjackson :: PR: #6776
Fix error appearing when using tar datasets by @Jorjeous :: PR: #6502
Fix normalization of impulse response in ImpulsePerturbation by @anteju :: PR: #6505
Fix typos by @titu1994 :: PR: #6523
Fix notebook bad json by @titu1994 :: PR: #6561
[ASR] Fix for old models in change_attention_model by @sam1373 :: PR: #6608
Fix k2 installation in Docker with CUDA 12 by @artbataev :: PR: #6707
Tutorial fixes by @titu1994 :: PR: #6717
Vp fixes by @titu1994 :: PR: #6738
[TTS] Fix aligner nan loss in fp32 by @hsiehjackson :: PR: #6435
fix conversion and eval by @arendu :: PR: #6648
Fix checkpointed forward and add test for full activation checkpointing by @aklife97 :: PR: #6744
add call to p2p overlap by @aklife97 :: PR: #6779
Fix get_parameters when using main params optimizer by @ericharper :: PR: #6764
Fix GPTDataset Assert by @MaximumEntropy :: PR: #6798
fix notebook error by @yidong72 :: PR: #6840
final fix of notebook by @yidong72 :: PR: #6842

General Improvements

Changelog

Code-Switching dataset creation - upgrading to aggregate tokenizer manifest format by @KunalDhawan :: PR: #6448
Fix an invalid link in get_data.py of ljspeech by @pythinker :: PR: #6456
Update manifest.py to use os.path for get_full_path by @stevehuang52 :: PR: #6598
Cherry pick commits in #6528 to main by @timmoon10 :: PR: #6613
Move black parameters to pyproject.toml by @artbataev :: PR: #6647
handle artifacts when path is an extracted dir by @arendu :: PR: #6658
remove upgrading setuptools in reinstall.sh by @XuesongYang :: PR: #6659
Upgrade to PyTorch 23.04 Container by @ericharper :: PR: #6660
Fix fastpitch test nightly by @hsiehjackson :: PR: #6742
Fix Links for tutorials by @titu1994 :: PR: #6777
Update core version in Jenkinsfile by @aklife97 :: PR: #6817
Update mcore requirement to 0.2.0 by @ericharper :: PR: #6875

相关地址：原始地址下载(tar) 下载(zip)

1、 conf-ensembles-overview.png 130.95KB

查看：2023-06-16发行的版本