v0.5.2
版本发布时间: 2024-07-26 16:07:49
InternLM/lmdeploy最新发布版本:v0.6.0a0(2024-08-26 17:12:19)
Highlight
- LMDeploy support Llama3.1 and its Tool Calling. An example of calling "Wolfram Alpha" to perform complex mathematical calculations can be found from here
What's Changed
🚀 Features
- Support glm4 awq by @AllentDan in https://github.com/InternLM/lmdeploy/pull/1993
- Support llama3.1 by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/2122
- Support Llama3.1 tool calling by @AllentDan in https://github.com/InternLM/lmdeploy/pull/2123
💥 Improvements
- Remove the triton inference server backend "turbomind_backend" by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/1986
- Remove kv cache offline quantization by @AllentDan in https://github.com/InternLM/lmdeploy/pull/2097
- Remove
session_len
and deprecated short names of the chat templates by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/2105 - clarify "n>1" in GenerationConfig hasn't been supported yet by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/2108
🐞 Bug fixes
- fix stop words for glm4 by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/2044
- Disable peer access code by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/2082
- set log level ERROR in benchmark scripts by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/2086
- raise thread exception by @irexyc in https://github.com/InternLM/lmdeploy/pull/2071
- Fix index error when profiling token generation with
-ct 1
by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/1898
🌐 Other
- misc: replace slow Jimver/cuda-toolkit by @zhyncs in https://github.com/InternLM/lmdeploy/pull/2065
- misc: update bug issue template by @zhyncs in https://github.com/InternLM/lmdeploy/pull/2083
- update daily testcase new by @zhulinJulia24 in https://github.com/InternLM/lmdeploy/pull/2035
- bump version to v0.5.2 by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/2143
Full Changelog: https://github.com/InternLM/lmdeploy/compare/v0.5.1...v0.5.2
1、 lmdeploy-0.5.2+cu118-cp310-cp310-manylinux2014_x86_64.whl 68.02MB
2、 lmdeploy-0.5.2+cu118-cp310-cp310-win_amd64.whl 45.19MB
3、 lmdeploy-0.5.2+cu118-cp311-cp311-manylinux2014_x86_64.whl 68.04MB
4、 lmdeploy-0.5.2+cu118-cp311-cp311-win_amd64.whl 45.19MB
5、 lmdeploy-0.5.2+cu118-cp312-cp312-manylinux2014_x86_64.whl 68.05MB
6、 lmdeploy-0.5.2+cu118-cp312-cp312-win_amd64.whl 45.19MB
7、 lmdeploy-0.5.2+cu118-cp38-cp38-manylinux2014_x86_64.whl 68.03MB
8、 lmdeploy-0.5.2+cu118-cp38-cp38-win_amd64.whl 45.19MB
9、 lmdeploy-0.5.2+cu118-cp39-cp39-manylinux2014_x86_64.whl 68.02MB
10、 lmdeploy-0.5.2+cu118-cp39-cp39-win_amd64.whl 45.18MB