v1.4.2
版本发布时间: 2024-02-21 21:52:00
huggingface/text-generation-inference最新发布版本:v2.3.1(2024-10-03 21:01:49)
Highlights
- Add support for Google Gemma models
What's Changed
- Fix mistral with length > window_size for long prefills (rotary doesn't create long enough cos, sin). by @Narsil in https://github.com/huggingface/text-generation-inference/pull/1571
- improve endpoint support by @drbh in https://github.com/huggingface/text-generation-inference/pull/1577
- refactor syntax to correctly include structs by @drbh in https://github.com/huggingface/text-generation-inference/pull/1580
- fix openapi and add jsonschema validation by @OlivierDehaene in https://github.com/huggingface/text-generation-inference/pull/1578
- add support for Gemma by @OlivierDehaene in https://github.com/huggingface/text-generation-inference/pull/1583
Full Changelog: https://github.com/huggingface/text-generation-inference/compare/v1.4.1...v1.4.2