v0.4.2
版本发布时间: 2023-03-30 23:10:21
huggingface/text-generation-inference最新发布版本:v2.3.1(2024-10-03 21:01:49)
Features
- benchmark: tui based benchmarking tool
- router: Clear cache on error
- server: Add mypy-protobuf
- server: reduce mlp and attn in one op for flash neox
- image: aws sagemaker compatible image
Fix
- server: avoid try/except to determine the kind of AutoModel
- server: fix flash neox rotary embedding