v0.1.0
版本发布时间: 2023-07-04 05:31:17
ray-project/ray-llm最新发布版本:v0.5.0(2024-01-19 04:38:00)
What's Changed
- Ray Serve-native continuous batching support through Hugging Face text-generation-inference models
- Fixed exceptions when frontend is deployed with non-default port
Note: This update breaks existing APIs and requires changes to model config YAMLs
Full Changelog: https://github.com/ray-project/aviary/compare/v0.0.3...v0.1.0