v0.2.0
版本发布时间: 2023-02-03 19:56:09
huggingface/text-generation-inference最新发布版本:v2.3.1(2024-10-03 21:01:49)
Features
- router: support Token streaming using Server Side Events
- router: support seeding
- server: support gpt-neox
- server: support santacoder
- server: support repetition penalty
- server: allow the server to use a local weight cache
Breaking changes
- router: refactor Token API
- router: modify /generate API to only return generated text
Misc
- router: use background task to manage request queue
- ci: docker build/push on update