v0.2.1dev20230908
版本发布时间: 2023-09-08 18:12:09
InternLM/InternLM最新发布版本:v0.2.1dev20240102(2024-01-02 13:05:11)
Highlights
- fix the bug that may have NaN value when overlap gradients' allreduce with backward
- support timeout wrapper and runtime diagnosis
- support readthedocs Chinese version
What's Changed
🚀 Features
- feat(monitor): add light monitor by @JiaoPL in https://github.com/InternLM/InternLM/pull/275
- feat(utils): add timeout wrapper by @SolenoidWGT in https://github.com/InternLM/InternLM/pull/286
- feat: add runtime diagnosis by @sunpengsdu in https://github.com/InternLM/InternLM/pull/297
💥 Improvements
- fix(storage): refactor and fix storage_manager api by @SolenoidWGT in https://github.com/InternLM/InternLM/pull/281
- Feat/sync grad use async op by @sunpengsdu in https://github.com/InternLM/InternLM/pull/277
🐞 Bug fixes
- fix(doc/code-docs): autodoc shown error by @huangting4201 in https://github.com/InternLM/InternLM/pull/265
- fix(eval): no need to check length of valid_dl when using streaming dataset by @00INDEX in https://github.com/InternLM/InternLM/pull/274
- fix/broadcast should not in commu stream by @sunpengsdu in https://github.com/InternLM/InternLM/pull/276
- fix(model): set tensor parallel attribute for mlp by @yingtongxiong in https://github.com/InternLM/InternLM/pull/271
- feat(ckpt): checkpoint bug fixes and feature enhancements. by @SolenoidWGT in https://github.com/InternLM/InternLM/pull/259
- fix(ckpt): fix checkpoint reload bug by @SolenoidWGT in https://github.com/InternLM/InternLM/pull/282
- fix(core/context): use dummy mode to generate random numbers in model construction by @blankde in https://github.com/InternLM/InternLM/pull/266
- fix(monitor): add alert switch and refactor monitor config by @JiaoPL in https://github.com/InternLM/InternLM/pull/285
- fix: fix the bug to do bcast in a stream by @sunpengsdu in https://github.com/InternLM/InternLM/pull/294
📚 Documentations
- docs(*): add documentation and reST files for readthedocs by @zigzagcai in https://github.com/InternLM/InternLM/pull/272
- docs(doc/code-docs): support zh cn readthedocs by @huangting4201 in https://github.com/InternLM/InternLM/pull/289
- docs(fsdp): add training option for fsdp by @zaglc in https://github.com/InternLM/InternLM/pull/273
- docs(doc/code-docs): refine profiler docs by @zigzagcai in https://github.com/InternLM/InternLM/pull/295
🌐 Other
Known issues
New Contributors
- @JiaoPL made their first contribution in https://github.com/InternLM/InternLM/pull/275
- @blankde made their first contribution in https://github.com/InternLM/InternLM/pull/266
- @zigzagcai made their first contribution in https://github.com/InternLM/InternLM/pull/272
- @zaglc made their first contribution in https://github.com/InternLM/InternLM/pull/273
Full Changelog: https://github.com/InternLM/InternLM/compare/v0.2.1dev20230901...v0.2.1dev20230908