v1.44.1
版本发布时间: 2024-08-21 22:58:16
BerriAI/litellm最新发布版本:v1.54.0(2024-12-08 12:50:00)
Guardrails on LiteLLM Proxy are now Free 🔥
Start here: https://docs.litellm.ai/docs/proxy/guardrails/quick_start
What's Changed
- Allow not displaying feedback box by @msabramo in https://github.com/BerriAI/litellm/pull/4868
- Fix app_version in helm build by @mikstew in https://github.com/BerriAI/litellm/pull/4649
- feat(azure.py): support 'json_schema' for older models by @krrishdholakia in https://github.com/BerriAI/litellm/pull/5296
- fix(cost_calculator.py): only override base model if custom pricing is set by @krrishdholakia in https://github.com/BerriAI/litellm/pull/5287
- feat(azure.py): support dynamic azure api versions by @krrishdholakia in https://github.com/BerriAI/litellm/pull/5284
- Fix helm chart job by @mikstew in https://github.com/BerriAI/litellm/pull/5297
- [Fix Router] - Don't cooldown Default Provider deployment by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/5302
- [Fix] Router - Do not retry on 404 errors from LLM API providers by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/5298
- [Fix Router] Don't retry errors when healthy_deployments=0 by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/5299
- [Fix] Router - don't recursively use the same fallback by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/5301
- [Fix Docker] Maintain separate docker image for running as non-root user by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/5306
- [Feat-Proxy] Make Guardrails Free / OSS - Lakera AI, Aporia AI 🛡️ by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/5303
- [Docs] - v2 Guardrails are now Free / Open Source by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/5309
New Contributors
- @mikstew made their first contribution in https://github.com/BerriAI/litellm/pull/4649
Full Changelog: https://github.com/BerriAI/litellm/compare/v1.43.19...v1.44.1
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.44.1
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 150.0 | 224.93742085437768 | 6.223856027540356 | 0.0 | 1861 | 0 | 116.26619000003302 | 15052.881646000003 |
Aggregated | Passed ✅ | 150.0 | 224.93742085437768 | 6.223856027540356 | 0.0 | 1861 | 0 | 116.26619000003302 | 15052.881646000003 |
1、 load_test.html 1.59MB
2、 load_test_stats.csv 546B