v0.1.0a1
版本发布时间: 2023-11-29 21:51:22
InternLM/lmdeploy最新发布版本:v0.6.0a0(2024-08-26 17:12:19)
What's Changed
💥 Improvements
- Set the default value of
max_context_token_num
1 by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/761 - add triton server test and workflow yml by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/760
- improvement(build): enable ninja and gold linker by @tpoisonooo in https://github.com/InternLM/lmdeploy/pull/767
- Report first-token-latency and token-latency percentiles by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/736
- convert model with hf repo_id by @irexyc in https://github.com/InternLM/lmdeploy/pull/774
🐞 Bug fixes
- [Fix] build docker image failed since
packaging
is missing by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/753 - [Fix] Rollback the data type of
input_ids
toTYPE_UINT32
in preprocessor's proto by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/758 - fix turbomind build on sm<80 by @grimoire in https://github.com/InternLM/lmdeploy/pull/754
- fix typo by @grimoire in https://github.com/InternLM/lmdeploy/pull/769
🌐 Other
- bump version to 0.1.0a1 by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/776
Full Changelog: https://github.com/InternLM/lmdeploy/compare/v0.1.0a0...v0.1.0a1