v0.4.6
版本发布时间: 2024-07-24 23:35:08
Lightning-AI/litgpt最新发布版本:v0.4.11(2024-08-24 01:42:08)
What's Changed
- Change default top_k to 50 everywhere for consistency by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1592
- Fix kv-cache clearing in Python API and Serve by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1596
- dynamic KV Cache batching by @aniketmaurya in https://github.com/Lightning-AI/litgpt/pull/1600
- Remove non-used eos_id in Python API by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1594
- Add quantization test and revert lightning version by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1605
- Dynamically set kv-cache size in serve by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1602
- Update LitData version and restore previous LitData assertions in tests by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1609
- Gemma 2:
9b
and27b
versions by @Andrei-Aksionov in https://github.com/Lightning-AI/litgpt/pull/1545 - Update config hub table qlora sections by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1611
- max_returned_tokens -> max_new_tokens by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1612
- Add warning about pretrain preprocessing by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1618
- Print warning about unsupported repo_ids by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1617
- Restore capability to load alternative weights by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1620
- Enable unbalanced number of layers in sequential generation by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1623
- Llama 3.1 8B and 70B checkpoints by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1619
- Add Llama 3.1 405B config by @awaelchli in https://github.com/Lightning-AI/litgpt/pull/1622
- Bumb version to 0.4.6 for next release (Gemma 2 and Llama 3.1) by @rasbt in https://github.com/Lightning-AI/litgpt/pull/1626
Full Changelog: https://github.com/Lightning-AI/litgpt/compare/v0.4.5...v0.4.6