v4.4.0

版本发布时间: 2024-09-09 17:21:54

OpenNMT/CTranslate2最新发布版本:v4.4.0(2024-09-09 17:21:54)

Removed: Flash Attention support in the Python package due to significant package size increase with minimal performance gain.
Note: Flash Attention remains supported in the C++ package with the WITH_FLASH_ATTN option.
Flash Attention may be re-added in the future if substantial improvements are made.