v0.1.0a0
版本发布时间: 2023-11-23 21:05:58
InternLM/lmdeploy最新发布版本:v0.6.0a0(2024-08-26 17:12:19)
What's Changed
🚀 Features
- Add extra_requires to reduce dependencies by @RunningLeon in https://github.com/InternLM/lmdeploy/pull/580
- TurboMind 2 by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/590
- Support loading hf model directly by @irexyc in https://github.com/InternLM/lmdeploy/pull/685
💥 Improvements
- Fix Tokenizer encode by @AllentDan in https://github.com/InternLM/lmdeploy/pull/645
- Optimize for throughput by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/701
- Replace mmengine with mmengine-lite by @zhouzaida in https://github.com/InternLM/lmdeploy/pull/715
🐞 Bug fixes
- Fix init of batch state by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/682
- fix turbomind stream canceling by @grimoire in https://github.com/InternLM/lmdeploy/pull/686
- [Fix] Fix load_checkpoint_in_model bug by @HIT-cwh in https://github.com/InternLM/lmdeploy/pull/690
- Fix wrong eos_id and bos_id obtained through grpc api by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/644
- Fix cache/output length calculation by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/738
- [Fix] Skip empty batch by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/747
📚 Documentations
- [Docs] Update Supported Matrix by @pppppM in https://github.com/InternLM/lmdeploy/pull/679
- [Docs] Update KV8 Docs by @pppppM in https://github.com/InternLM/lmdeploy/pull/681
- [Doc] Update restful api doc by @AllentDan in https://github.com/InternLM/lmdeploy/pull/662
- Check-in user guide about turbomind config by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/680
🌐 Other
- bump version to v0.1.0a0 by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/709
New Contributors
- @zhouzaida made their first contribution in https://github.com/InternLM/lmdeploy/pull/715
Full Changelog: https://github.com/InternLM/lmdeploy/compare/v0.0.14...v0.1.0a0