MyGit

v0.10.0

NVIDIA/TensorRT-LLM

版本发布时间: 2024-06-05 21:02:34

NVIDIA/TensorRT-LLM最新发布版本:v0.13.0(2024-09-30 16:37:55)

Hi,

We are very pleased to announce the 0.10.0 version of TensorRT-LLM. It has been an intense effort, and we hope that it will enable you to easily deploy GPU-based inference for state-of-the-art LLMs. We want TensorRT-LLM to help you run those LLMs very fast.

This update includes:

Key Features and Enhancements

API Changes

Model Updates

Fixed Issues

Infrastructure changes

Currently, there are two key branches in the project:

We are updating the main branch regularly with new features, bug fixes and performance optimizations. The rel branch will be updated less frequently, and the exact frequencies depend on your feedback.

Thanks, The TensorRT-LLM Engineering Team

相关地址:原始地址 下载(tar) 下载(zip)

查看:2024-06-05发行的版本