v0.5.0
版本发布时间: 2024-01-10 10:32:32
tatsu-lab/alpaca_eval最新发布版本:v0.6.2(2024-04-19 14:28:02)
What's Changed
- Fix mssg check by @Muennighoff in https://github.com/tatsu-lab/alpaca_eval/pull/174
- Add MiniChat-1.5-3B to AlpacaEval and Fix MiniChat-3B by @GeneZC in https://github.com/tatsu-lab/alpaca_eval/pull/176
- Add 01-ai/Yi-34B-Chat to AlpacaEval by @HyperdriveHustle in https://github.com/tatsu-lab/alpaca_eval/pull/175
- feat: add way to verify results by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/177
- show img in readme by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/178
- Add PairRM best-of-16 to AlpacaEval by @jdf-prog in https://github.com/tatsu-lab/alpaca_eval/pull/181
- Verify Yi by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/182
- chore: add phi-2 sft by @lxuechen in https://github.com/tatsu-lab/alpaca_eval/pull/184
- add cut-13b by @wwxu21 in https://github.com/tatsu-lab/alpaca_eval/pull/186
- chore: add phi-2 dpo by @lxuechen in https://github.com/tatsu-lab/alpaca_eval/pull/185
- Support phi2, Support SOLAR 10.7B LMCocktail by @yhyu13 in https://github.com/tatsu-lab/alpaca_eval/pull/183
- Update openai.py by @Muennighoff in https://github.com/tatsu-lab/alpaca_eval/pull/188
- chore: add link for phi-2-sft by @lxuechen in https://github.com/tatsu-lab/alpaca_eval/pull/190
- chore: fix links by @lxuechen in https://github.com/tatsu-lab/alpaca_eval/pull/191
- Add deita-7b-v1.0 model by @VPeterV in https://github.com/tatsu-lab/alpaca_eval/pull/192
- [ENH] Azure OAI client & more general way of switching between client configs by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/193
- [ENH] Weighted win rates by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/189
- [ENH] new models: Gemini / claude2.1 / mistral / mixtral / .. by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/195
- [ENH] alpaca_eval 2.0 by @YannDubs in https://github.com/tatsu-lab/alpaca_eval/pull/196
New Contributors
- @Muennighoff made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/174
- @HyperdriveHustle made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/175
- @jdf-prog made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/181
- @lxuechen made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/184
- @wwxu21 made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/186
- @yhyu13 made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/183
- @VPeterV made their first contribution in https://github.com/tatsu-lab/alpaca_eval/pull/192
Full Changelog: https://github.com/tatsu-lab/alpaca_eval/compare/v0.3.6...v0.5.0