v0.9.0
版本发布时间: 2024-02-22 16:03:44
xorbitsai/inference最新发布版本:v0.11.0(2024-05-11 17:41:09)
What's new in 0.9.0 (2024-02-22)
These are the changes in inference v0.9.0.
New features
- FEAT: Refactor device related code and add initial Intel GPU support by @notsyncing in https://github.com/xorbitsai/inference/pull/968
- FEAT: Support gemma series model by @aresnow1 in https://github.com/xorbitsai/inference/pull/1024
Enhancements
- ENH: [UI] Supports
replica
when launching LLM models by @ChengjieLi28 in https://github.com/xorbitsai/inference/pull/1011 - ENH: [UI] Show cluster resource information by @ChengjieLi28 in https://github.com/xorbitsai/inference/pull/1015
Bug fixes
- BUG: fix chat completion error when indexing body.messages by @fffonion in https://github.com/xorbitsai/inference/pull/1008
- BUG: Fix cache sd 1.5 error by @codingl2k1 in https://github.com/xorbitsai/inference/pull/1013
- BUG: fix typo in modelscope llama-2-13b-chat-GGUF by @qinxuye in https://github.com/xorbitsai/inference/pull/1026
- BUG: Fix missing qwen 1.5 7b gguf by @codingl2k1 in https://github.com/xorbitsai/inference/pull/1027
Documentation
- DOC: Polish model operation command doc by @onesuper in https://github.com/xorbitsai/inference/pull/1000
- DOC: Fix note on secret_key generation and algorithm selection for OAuth2 by @ChengjieLi28 in https://github.com/xorbitsai/inference/pull/1012
New Contributors
- @fffonion made their first contribution in https://github.com/xorbitsai/inference/pull/1008
- @notsyncing made their first contribution in https://github.com/xorbitsai/inference/pull/968
Full Changelog: https://github.com/xorbitsai/inference/compare/v0.8.5...v0.9.0