v0.4.1
版本发布时间: 2023-03-26 22:38:21
huggingface/text-generation-inference最新发布版本:v2.3.1(2024-10-03 21:01:49)
Features
- server: New faster GPTNeoX implementation based on flash attention
Fix
- server: fix input-length discrepancy between Rust and Python tokenizers