v0.2.0
版本发布时间: 2023-02-03 19:56:09
huggingface/text-generation-inference最新发布版本:v3.0.1(2024-12-12 04:13:58)
Features
- router: support Token streaming using Server Side Events
- router: support seeding
- server: support gpt-neox
- server: support santacoder
- server: support repetition penalty
- server: allow the server to use a local weight cache
Breaking changes
- router: refactor Token API
- router: modify /generate API to only return generated text
Misc
- router: use background task to manage request queue
- ci: docker build/push on update