v0.1.0a0

版本发布时间: 2023-11-23 21:05:58

InternLM/lmdeploy最新发布版本:v0.6.0a0(2024-08-26 17:12:19)

What's Changed

Add extra_requires to reduce dependencies by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/580
TurboMind 2 by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/590
Support loading hf model directly by @irexyc in https://github.com/InternLM/lmdeploy/pull/685

Fix init of batch state by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/682
fix turbomind stream canceling by @grimoire in https://github.com/InternLM/lmdeploy/pull/686
[Fix] Fix load_checkpoint_in_model bug by @HIT-cwh in https://github.com/InternLM/lmdeploy/pull/690
Fix wrong eos_id and bos_id obtained through grpc api by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/644
Fix cache/output length calculation by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/738
[Fix] Skip empty batch by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/747

[Docs] Update Supported Matrix by @pppppM in https://github.com/InternLM/lmdeploy/pull/679
[Docs] Update KV8 Docs by @pppppM in https://github.com/InternLM/lmdeploy/pull/681
[Doc] Update restful api doc by @AllentDan in https://github.com/InternLM/lmdeploy/pull/662
Check-in user guide about turbomind config by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/680

Full Changelog: https://github.com/InternLM/lmdeploy/compare/v0.0.14...v0.1.0a0