v0.4.1

版本发布时间: 2024-05-07 16:20:47

InternLM/lmdeploy最新发布版本:v0.6.0a0(2024-08-26 17:12:19)

What's Changed

variable CTA_H & fix qkv bias by @lzhangzz in https://github.com/InternLM/lmdeploy/pull/1491
refactor vision model loading by @irexyc in https://github.com/InternLM/lmdeploy/pull/1482
fix installation requirements for windows by @irexyc in https://github.com/InternLM/lmdeploy/pull/1531
Remove split batch inside pipline inference function by @AllentDan in https://github.com/InternLM/lmdeploy/pull/1507
Remove first empty chunck for api_server by @AllentDan in https://github.com/InternLM/lmdeploy/pull/1527
add benchmark script to profile pipeline APIs by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/1528
Add input validation by @AllentDan in https://github.com/InternLM/lmdeploy/pull/1525

fix local variable 'response' referenced before assignment in async_engine.generate by @irexyc in https://github.com/InternLM/lmdeploy/pull/1513
Fix turbomind import in windows by @irexyc in https://github.com/InternLM/lmdeploy/pull/1533
Fix convert qwen2 to turbomind by @AllentDan in https://github.com/InternLM/lmdeploy/pull/1546
Adding api_key and model_name parameters to the restful benchmark by @NiuBlibing in https://github.com/InternLM/lmdeploy/pull/1478

update supported models for Baichuan by @zhyncs in https://github.com/InternLM/lmdeploy/pull/1485
Fix typo in w8a8.md by @Infinity4B in https://github.com/InternLM/lmdeploy/pull/1523
complete build.md by @YanxingLiu in https://github.com/InternLM/lmdeploy/pull/1508
update readme wechat qrcode by @vansin in https://github.com/InternLM/lmdeploy/pull/1529
Update docker docs for VL api by @vody-am in https://github.com/InternLM/lmdeploy/pull/1534
Format supported model table using html syntax by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/1493
doc: add example of deploying api server to Kubernetes by @uzuku in https://github.com/InternLM/lmdeploy/pull/1488

add modelscope and lora testcase by @zhulinJulia24 in https://github.com/InternLM/lmdeploy/pull/1506
bump version to v0.4.1 by @lvhan028 in https://github.com/InternLM/lmdeploy/pull/1544

Full Changelog: https://github.com/InternLM/lmdeploy/compare/v0.4.0...v0.4.1