v1.56.4

版本发布时间: 2024-12-29 12:56:59

BerriAI/litellm最新发布版本:v1.56.9(2025-01-04 10:46:47)

What's Changed

Update model_prices_and_context_window.json by @superpoussin22 in https://github.com/BerriAI/litellm/pull/7452
(Refactor) 🧹 - remove deprecated litellm server by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/7456
📖 Docs - Using LiteLLM with 1M rows in spend logs by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/7461
(Admin UI - 1) - added the model used either directly before or after the "Assistant" so that it's clear which model provided the given assistant output by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/7459
(Admin UI - 2) UI chat should render the output in markdown by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/7460
(Security fix) - Upgrade to fastapi==0.115.5 by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/7447
fix OR deepseek by @paul-gauthier in https://github.com/BerriAI/litellm/pull/7425
(Bug Fix) Add health check support for realtime models by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/7453
(Refactor) - Re use litellm.completion/litellm.embedding etc for health checks by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/7455
Litellm dev 12 28 2024 p3 by @krrishdholakia in https://github.com/BerriAI/litellm/pull/7464
Fireworks AI - document inlining support + model access groups for wildcard models by @krrishdholakia in https://github.com/BerriAI/litellm/pull/7458

Full Changelog: https://github.com/BerriAI/litellm/compare/v1.56.3...v1.56.4

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.56.4

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	240.0	268.74238744669225	6.116896356155644	0.0	1829	0	214.29422199992132	1969.7571099999323
Aggregated	Passed ✅	240.0	268.74238744669225	6.116896356155644	0.0	1829	0	214.29422199992132	1969.7571099999323