v0.2.1dev20240102

InternLM/InternLM

版本发布时间: 2024-01-02 13:05:11

InternLM/InternLM最新发布版本:v0.2.1dev20240102(2024-01-02 13:05:11)

What's Changed

fix(timeout): larger timeout by @JiaoPL in https://github.com/InternLM/InternLM/pull/495
feat(doc): add GPU memory info for 7B & 20B models by @li126com in https://github.com/InternLM/InternLM/pull/507
feat(model): add rope_base interface by @00INDEX in https://github.com/InternLM/InternLM/pull/512
Feat(QA): Check loss when swapping micro_num and micro_bsz && Check grad norm by @li126com in https://github.com/InternLM/InternLM/pull/510
Fix(QA): the py name in main is wrong by @li126com in https://github.com/InternLM/InternLM/pull/514
fix/feat: small fix and enhancement by @SolenoidWGT in https://github.com/InternLM/InternLM/pull/515
test(workflow): add workflow for loss test and change trigger event by @kkscilife in https://github.com/InternLM/InternLM/pull/513
fix(ci): fix test model ckpt ci test by @SolenoidWGT in https://github.com/InternLM/InternLM/pull/518
test(workflow): add unit test case by @kkscilife in https://github.com/InternLM/InternLM/pull/524
feat(storage): use multipart upload when using oss by @li126com in https://github.com/InternLM/InternLM/pull/520
Fix (QA checkpoint): fix test_model_checkpoint singleton import by @li126com in https://github.com/InternLM/InternLM/pull/526
fix(model): add IS_SEQUENCE_PARALLEL check for norm module by @yingtongxiong in https://github.com/InternLM/InternLM/pull/528
feat(model): add output embedding tf32 option by @JiaoPL in https://github.com/InternLM/InternLM/pull/523
feat(grad_norm): vocab grad norm profiling by @JiaoPL in https://github.com/InternLM/InternLM/pull/519
fix(data): fix the unpack for type_ids when use_flash_attn=False by @yingtongxiong in https://github.com/InternLM/InternLM/pull/516
fix(storage): unify the name of AK and SK by @li126com in https://github.com/InternLM/InternLM/pull/527
fix(test): fix type_ids unpack bug by @SolenoidWGT in https://github.com/InternLM/InternLM/pull/530
feat(model): support llama model with checkpoint loading by @li126com in https://github.com/InternLM/InternLM/pull/532
fix(metric): add metric dtype control by @Pryest in https://github.com/InternLM/InternLM/pull/533
feat(ckpt): support auto resume in Volc and Ali by @li126com in https://github.com/InternLM/InternLM/pull/529
fix(sequence_parallel): fix norm all-reduce in seq_parallel when not overlaping by @yingtongxiong in https://github.com/InternLM/InternLM/pull/534
fix(pp): fix no-packed dataset load micro batch error by @SolenoidWGT in https://github.com/InternLM/InternLM/pull/538
fix(model): change model_type LLAMA to LLAMA2 by @li126com in https://github.com/InternLM/InternLM/pull/539
fix(moe): fix moe zero mode bug by @blankde in https://github.com/InternLM/InternLM/pull/548
fix(grad_norm): token grad norm with tp by @JiaoPL in https://github.com/InternLM/InternLM/pull/547
test(workflow): change into reserved by @kkscilife in https://github.com/InternLM/InternLM/pull/550
fix(model): add ckpt_type constraint when loading ckpts by @li126com in https://github.com/InternLM/InternLM/pull/542
feat(logger): add tensorboard key value buffer by @SolenoidWGT in https://github.com/InternLM/InternLM/pull/549
fix(metrics): remove redundant cuda memory in metric calculations by @SolenoidWGT in https://github.com/InternLM/InternLM/pull/557
fix(lr_scheduler): fix when resuming lr_scheduler without loading optimizer by @gaoyang07 in https://github.com/InternLM/InternLM/pull/565

Full Changelog: https://github.com/InternLM/InternLM/compare/v0.2.1dev20231121...v0.2.1dev20240102

相关地址：原始地址下载(tar) 下载(zip)

查看：2024-01-02发行的版本