v0.14.4

microsoft/DeepSpeed

版本发布时间: 2024-06-22 01:33:01

microsoft/DeepSpeed最新发布版本:v0.15.1(2024-09-05 09:30:51)

What's Changed

Update version.txt after 0.14.3 release by @mrwyattii in https://github.com/microsoft/DeepSpeed/pull/5651
[CPU] SHM based allreduce improvement for small message size by @delock in https://github.com/microsoft/DeepSpeed/pull/5571
_exec_forward_pass: place zeros(1) on the same device as the param by @nelyahu in https://github.com/microsoft/DeepSpeed/pull/5576
[XPU] adapt lazy_call func to different versions by @YizhouZ in https://github.com/microsoft/DeepSpeed/pull/5670
fix IDEX dependence in xpu accelerator by @Liangliang-Ma in https://github.com/microsoft/DeepSpeed/pull/5666
Remove compile wrapper to simplify access to model attributes by @tohtana in https://github.com/microsoft/DeepSpeed/pull/5581
Fix hpZ with zero element by @samadejacobs in https://github.com/microsoft/DeepSpeed/pull/5652
Fixing the reshape bug in sequence parallel alltoall, which corrupted all QKV data by @YJHMITWEB in https://github.com/microsoft/DeepSpeed/pull/5664
enable yuan autotp & add conv tp by @Yejing-Lai in https://github.com/microsoft/DeepSpeed/pull/5428
Fix latest pytorch '_get_socket_with_port' import error by @Yejing-Lai in https://github.com/microsoft/DeepSpeed/pull/5654
Fix numpy upgrade to 2.0.0 BUFSIZE import error by @Yejing-Lai in https://github.com/microsoft/DeepSpeed/pull/5680
Update BUFSIZE to come from autotuner's constants.py, not numpy by @loadams in https://github.com/microsoft/DeepSpeed/pull/5686
[XPU] support op builder from intel_extension_for_pytorch kernel path by @YizhouZ in https://github.com/microsoft/DeepSpeed/pull/5425

New Contributors

@YJHMITWEB made their first contribution in https://github.com/microsoft/DeepSpeed/pull/5664

Full Changelog: https://github.com/microsoft/DeepSpeed/compare/v0.14.3...v0.14.4

相关地址：原始地址下载(tar) 下载(zip)

查看：2024-06-22发行的版本