MyGit

v1.19.0

NVIDIA/NeMo

版本发布时间: 2023-06-16 07:46:05

NVIDIA/NeMo最新发布版本:r2.0.0rc1(2024-08-16 05:55:14)

Highlights

NeMo ASR

NeMo TTS

NeMo Megatron

Container

For additional information regarding NeMo containers, please visit: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/nemo

docker pull nvcr.io/nvidia/nemo:23.04

Detailed Changelogs

ASR

Changelog
  • Sharded manifests for tarred datasets by @bmwshop :: PR: #6395
  • Update script for ngram rnnt and hat beam search decoding by @andrusenkoau :: PR: #6370
  • Add disclaimer about dataset for ASR by @titu1994 :: PR: #6496
  • New noise_norm perturbation based on Riva work by @trias702 :: PR: #6445
  • Add Frame-VAD model and datasets by @stevehuang52 :: PR: #6441
  • removing unnecessary avoid_bfloat16_autocast_context by @bmwshop :: PR: #6481
  • FC models in menu by @bmwshop :: PR: #6473
  • Separate punctuation by whitespace by @karpnv :: PR: #6574
  • Cherry pick commits in #6601 to main by @fayejf :: PR: #6611
  • Offline and streaming inference support for hybrid model by @fayejf :: PR: #6570
  • Disable interctc tests by @Kipok :: PR: #6638
  • ASR-TTS Models: Support hybrid RNNT-CTC, improve docs. by @artbataev :: PR: #6620
  • Confidence ensembles implementation by @Kipok :: PR: #6614
  • Confidence ensembles: fix issues and add tuning functionality by @Kipok :: PR: #6657
  • Add support for RNNT/hybrid models to partial transcribe by @stevehuang52 :: PR: #6609
  • eval_beamsearch_ngram.py with hybrid ctc by @karpnv :: PR: #6656

TTS

Changelog
  • [TTS] FastPitch adapter fine-tune and conditional layer normalization by @hsiehjackson :: PR: #6416
  • [TTS] whitelist broken path fix. by @XuesongYang :: PR: #6412
  • [TTS] FastPitch speaker encoder by @hsiehjackson :: PR: #6417
  • Update NeMo_TTS_Primer.ipynb by @pythinker :: PR: #6436
  • [TTS] Create functions for TTS preprocessing without dataloader by @rlangman :: PR: #6317
  • [TTS] Fix FastPitch energy code by @rlangman :: PR: #6511
  • [TTS] Add script for computing feature stats by @rlangman :: PR: #6508
  • [TTS] Add tutorials for FastPitch TTS speaker adaptation with adapters by @hsiehjackson :: PR: #6431
  • [TTS] Create initial TTS dataset feature processors by @rlangman :: PR: #6507
  • [TTS] Add script for mapping speaker names to indices by @rlangman :: PR: #6509
  • [TTS] Implement new TextToSpeech dataset by @rlangman :: PR: #6575

NLP / NMT

Changelog
  • Add patches for Virtual Parallel conversion by @titu1994 :: PR: #6589
  • Update wfst_text_normalization.rst by @jimregan :: PR: #6374
  • add rampup batch size support for Megatron GPT by @dimapihtar :: PR: #6424
  • Add interleaved pp support by @titu1994 :: PR: #6498
  • Support dynamic length batches with GPT SFT by @aklife97 :: PR: #6510
  • Framework for PEFT via mixins by @arendu :: PR: #6391
  • Add GPT eval mode fix for interleaved to main (#6449) by @aklife97 :: PR: #6610
  • sft model can use this script for eval by @arendu :: PR: #6637
  • Patch memory used for NeMo Megatron models by @titu1994 :: PR: #6615
  • merge lora weights into base model by @arendu :: PR: #6597
  • Dialogue dataset by @yidong72 :: PR: #6654
  • check for first or last stage by @ericharper :: PR: #6708
  • A few small typo fixes by @Kipok :: PR: #6599
  • Lddl bert by @wdykas :: PR: #6761
  • Debug Transformer Engine FP8 support with Megatron-core infrastructure by @timmoon10 :: PR: #6740
  • Tensor-parallel communication overlap with userbuffer backend by @erhoo82 :: PR: #6780
  • Add ub communicator initialization to validation step by @erhoo82 :: PR: #6807
  • Add trainer.validate example for GPT by @ericharper :: PR: #6794
  • Add API docs for NeMo Megatron by @ericharper :: PR: #6850
  • Apply garbage collection interval to validation steps by @erhoo82 :: PR: #6870

Bugfixes

Changelog
  • [BugFix] Force _get_batch_preds() to keep logits in decoder timestamps generator by @tango4j :: PR: #6499
  • small bugfix for asr_evaluator by @fayejf :: PR: #6636
  • fix bucketing bug issue for picking new bucket by @nithinraok :: PR: #6663
  • [TTS] Fix TTS audio preprocessing bugs by @rlangman :: PR: #6628
  • Fix a bug, use _ceil_to_nearest instead as _round_to_nearest is not d… by @BestJuly :: PR: #6681
  • Bug fix to restore act ckpt by @markelsanz14 :: PR: #6753
  • Bug fix to reset sequence parallelism by @markelsanz14 :: PR: #6756
  • Bug fix for reset_sequence_parallel_args by @markelsanz14 :: PR: #6802
  • Fix adapter tutorial r1.19.0 by @hsiehjackson :: PR: #6776
  • Fix error appearing when using tar datasets by @Jorjeous :: PR: #6502
  • Fix normalization of impulse response in ImpulsePerturbation by @anteju :: PR: #6505
  • Fix typos by @titu1994 :: PR: #6523
  • Fix notebook bad json by @titu1994 :: PR: #6561
  • [ASR] Fix for old models in change_attention_model by @sam1373 :: PR: #6608
  • Fix k2 installation in Docker with CUDA 12 by @artbataev :: PR: #6707
  • Tutorial fixes by @titu1994 :: PR: #6717
  • Vp fixes by @titu1994 :: PR: #6738
  • [TTS] Fix aligner nan loss in fp32 by @hsiehjackson :: PR: #6435
  • fix conversion and eval by @arendu :: PR: #6648
  • Fix checkpointed forward and add test for full activation checkpointing by @aklife97 :: PR: #6744
  • add call to p2p overlap by @aklife97 :: PR: #6779
  • Fix get_parameters when using main params optimizer by @ericharper :: PR: #6764
  • Fix GPTDataset Assert by @MaximumEntropy :: PR: #6798
  • fix notebook error by @yidong72 :: PR: #6840
  • final fix of notebook by @yidong72 :: PR: #6842

General Improvements

Changelog
  • Code-Switching dataset creation - upgrading to aggregate tokenizer manifest format by @KunalDhawan :: PR: #6448
  • Fix an invalid link in get_data.py of ljspeech by @pythinker :: PR: #6456
  • Update manifest.py to use os.path for get_full_path by @stevehuang52 :: PR: #6598
  • Cherry pick commits in #6528 to main by @timmoon10 :: PR: #6613
  • Move black parameters to pyproject.toml by @artbataev :: PR: #6647
  • handle artifacts when path is an extracted dir by @arendu :: PR: #6658
  • remove upgrading setuptools in reinstall.sh by @XuesongYang :: PR: #6659
  • Upgrade to PyTorch 23.04 Container by @ericharper :: PR: #6660
  • Fix fastpitch test nightly by @hsiehjackson :: PR: #6742
  • Fix Links for tutorials by @titu1994 :: PR: #6777
  • Update core version in Jenkinsfile by @aklife97 :: PR: #6817
  • Update mcore requirement to 0.2.0 by @ericharper :: PR: #6875

相关地址:原始地址 下载(tar) 下载(zip)

1、 conf-ensembles-overview.png 130.95KB

查看:2023-06-16发行的版本