v1.41.8

版本发布时间: 2024-07-06 12:08:09

BerriAI/litellm最新发布版本:v1.44.15-stable(2024-09-04 00:07:25)

🛠️ fix - raise report Anthropic streaming errors (thanks David Manouchehri)

✨ [Fix] Add nvidia nim param mapping based on model passed

Group 5879

What's Changed

fix(anthropic.py): add index to streaming tool use by @igor-drozdov in https://github.com/BerriAI/litellm/pull/4554
(fix) fixed bug with the watsonx embedding endpoint by @simonsanvil in https://github.com/BerriAI/litellm/pull/4540
Revert "(fix) fixed bug with the watsonx embedding endpoint" by @krrishdholakia in https://github.com/BerriAI/litellm/pull/4561
[docs] add example Grafana Dashboard by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/4563
build(deps): bump certifi from 2023.7.22 to 2024.7.4 by @dependabot in https://github.com/BerriAI/litellm/pull/4568
fix(proxy/utils.py): support logging rejected requests to langfuse, etc. by @krrishdholakia in https://github.com/BerriAI/litellm/pull/4564
[Feat] Add Galileo Logging Callback by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/4567
[Fix] Add nvidia nim param mapping based on model by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/4565
fix - raise report Anthropic streaming errors by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/4566
feat - control guardrails per api key by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/4569

@igor-drozdov made their first contribution in https://github.com/BerriAI/litellm/pull/4554

Full Changelog: https://github.com/BerriAI/litellm/compare/v1.41.7...v1.41.8

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.41.8

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	120.0	148.48763956993193	6.382118352365276	0.0	1909	0	109.10986900000808	1689.413720999994
Aggregated	Passed ✅	120.0	148.48763956993193	6.382118352365276	0.0	1909	0	109.10986900000808	1689.413720999994