core_v0.5.0
版本发布时间: 2024-03-23 00:44:29
NVIDIA/Megatron-LM最新发布版本:core_r0.8.0(2024-08-13 20:12:40)
Key Features and Enhancements
Megatron core documentation is now live!
Model Features
- MoE (Mixture of Experts)
- Support for Z-loss, Load balancing and Sinkhorn
- Layer and communications refactor
- Richer parallelism mappings and EP can be combined with other model parallel techniques for larger MoE variants, e.g. EP + TP + DP + SP + PP
- Token dropless architecture with Top-K routing
- Performance optimization with with GroupedGEMM when number of local experts is > 1
- Distributed checkpointing
- Interleaved rotary embedding
Datasets
- Masked WordPiece datasets for BERT and T5
- Raw and mock datasets
Parallelism
Performance
- Activation offloading to CPU
- Rope and Swiglu fusion
- Sliding window attention (via Transformer Engine)
General Improvements
- Timers