v1.35.12
版本发布时间: 2024-04-18 12:33:15
BerriAI/litellm最新发布版本:v1.37.3(2024-05-11 11:16:52)
What's Changed
- fix(vertex_ai.py): fix faulty async call tool calling check by @krrishdholakia in https://github.com/BerriAI/litellm/pull/3102
- Support for Claude 3 Opus on vertex_ai by @Dev-Khant in https://github.com/BerriAI/litellm/pull/3026
- [FIX} Repeat Slack Alerts triggered for "User Crossed Budget" by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/3114
Full Changelog: https://github.com/BerriAI/litellm/compare/v1.35.11...v1.35.12
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 81 | 90.18032295762929 | 1.5764885124209485 | 0.0 | 472 | 0 | 75.0108780000005 | 1097.2126149999895 |
/health/liveliness | Passed ✅ | 66 | 68.85966111970596 | 15.457603465008791 | 0.0 | 4628 | 0 | 63.27401099997587 | 1008.0632059999743 |
/health/readiness | Passed ✅ | 66 | 69.48201319830875 | 15.006701030312122 | 0.003340018034790145 | 4493 | 1 | 63.539215999981025 | 1354.8842850000256 |
Aggregated | Passed ✅ | 66 | 70.20017819222355 | 32.04079300774186 | 0.003340018034790145 | 9593 | 1 | 63.27401099997587 | 1354.8842850000256 |
1、 load_test.html 1.59MB
2、 load_test_stats.csv 889B