v0.5.3
版本发布时间: 2024-02-01 16:54:42
tatsu-lab/alpaca_eval最新发布版本:v0.6.5(2024-08-18 07:39:20)
What's Changed
- [ENH] add mistral-medium by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/205
- [ENH] add internlm2-chat-20b-ppo by @C1rN09 in https://github.com/tatsu-lab/alpaca_eval/pull/207
- prettify "pretty_name" of internlm2 by @C1rN09 in https://github.com/tatsu-lab/alpaca_eval/pull/208
- [ENH] add outputs & configs form dolphin 2.2.1 by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/209
- Add PairRM 0.4B + Yi-34B-Chat to AlpacaEval 2.0 by @jdf-prog in https://github.com/tatsu-lab/alpaca_eval/pull/210
- dolphin 2.1.1 configs.yaml by @gblazex in https://github.com/tatsu-lab/alpaca_eval/pull/212
- Update README.md (small typo) by @xwinxu in https://github.com/tatsu-lab/alpaca_eval/pull/213
- [TEST]: fix ordering of df by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/214
- Add Snorkel-Mistral-PairRM-DPO (best-of-16) to Alpaca Eval 2.0 by @viethoangtranduong in https://github.com/tatsu-lab/alpaca_eval/pull/215
- update InternLM2 chat template by @C1rN09 in https://github.com/tatsu-lab/alpaca_eval/pull/216
- Add Starling-LM-7B-alpha, vicuna-13b-v1.5, vicuna-7b-v1.5 to AlpacaEval (config + outputs without annotations) by @gblazex in https://github.com/tatsu-lab/alpaca_eval/pull/217
- [RES] add 3 models for arena correlations by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/218
- Add xwinlm-70b-v0.3 to AlpacaEval by @nbl97 in https://github.com/tatsu-lab/alpaca_eval/pull/221
- [ENH] add referenced_models locally by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/224
New Contributors
- @C1rN09 made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/207
- @gblazex made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/212
- @xwinxu made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/213
- @viethoangtranduong made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/215
Full Changelog: https://github.com/tatsu-lab/alpaca_eval/compare/v0.5.2...v0.5.3