v0.3.0

版本发布时间: 2024-01-23 00:48:50

NVIDIA/GenerativeAIExamples最新发布版本:v0.8.0(2024-08-21 11:11:58)

This release adds support for PGvector Vector DB, speech-in speech-out support using RIVA and RAG observability tooling. This release also adds a dedicated example for RAG pipeline using only models from NVIDIA AI Foundation and one example demonstrating query decomposition. Detailed changes are listed below:

Added

New dedicated example showcasing Nvidia AI Playground based models using Langchain connectors.
New example demonstrating query decomposition.
Support for using PG Vector as a vector database in the developer rag canonical example.
Support for using Speech-in Speech-out interface in the sample frontend leveraging RIVA Skills.
New tool showcasing RAG observability support.
Support for on-prem deployment of TRTLLM based nemotron models.

Changed

Upgraded Langchain and llamaindex dependencies for all container.
Restructured README files for better intuitiveness.
Added provision to plug in multiple examples using a common base class.
Changed minio service's port to 9010from 9000 in docker based deployment.
Moved evaluation directory from top level to under tools and created a dedicated compose file.
Added an experimental directory for plugging in experimental features.
Modified notebooks to use TRTLLM and Nvidia AI foundation based connectors from langchain.
Changed ai-playground model engine name to nv-ai-foundation in configurations.

Fixed

Fixed issue #19

相关地址：原始地址下载(tar) 下载(zip)

查看：2024-01-23发行的版本