22.05
版本发布时间: 2022-05-14 05:52:11
NVIDIA/TensorRT最新发布版本:v10.3.0(2024-08-09 07:23:49)
Commit used by the 22.05 TensorRT NGC container.
Changelog
Added
- Disentangled attention plugin for DeBERTa
- DMHA (multiscaleDeformableAttnPlugin) plugin for DDETR
- Performance benchmarking mode to HuggingFace demo
Changed
- Updated base TensorRT version to 8.2.5.1
- Updated onnx-graphsurgeon v0.3.19 CHANGELOG
- fp16 support for pillarScatterPlugin
-
#1939 - Fixed path in quantization
classification_flow
- Fixed GPT2 onnx export failure due to 2G limitation
- Use axis0 as default for deconv in pytorch-quantization toolkit
- Updated onnx export script for CoordConvAC sample
- Install devtoolset-8 for updated g++ version in CentOS7 container
Removed
- Usage of deprecated TensorRT APIs in samples removed
-
quant_bert.py
module removed from pytorch-quantization