MyGit

0.6.1

Mozilla-Ocho/llamafile

版本发布时间: 2024-01-20 16:09:47

Mozilla-Ocho/llamafile最新发布版本:0.8.13(2024-08-19 01:22:48)

llamafile lets you distribute and run LLMs with a single file

[line drawing of llama animal head in front of slightly open manilla folder filled with files]

This release fixes a crash that can happen on Apple Metal GPUs.

Windows users will see better performance with tinyBLAS. Please note we still recommend installing the CUDA SDK (NVIDIA), or HIP/ROCm SDK (AMD) for maximum performance and accuracy if you're in their support vector.

This release also synchronizes with llama.cpp upstream (as of Jan 9th) along with other improvements.

Example llamafiles

Our llamafiles on Hugging Face are updated shortly after a release goes live.

Flagship models

Supreme models (highest-end consumer hardware)

Tiny models (small enough to use on raspberry pi)

Other models:

If you have a slow Internet connection and want to update your llamafiles without needing to redownload, then see the instructions here: https://github.com/Mozilla-Ocho/llamafile/issues/24#issuecomment-1836362558 You can also download llamafile-0.6.1 and simply say ./llamafile-0.6.1 -m old.llamafile to run your old weights.

相关地址:原始地址 下载(tar) 下载(zip)

1、 llamafile-0.6.1 30.18MB

2、 llamafile-0.6.1.zip 14.15MB

3、 zipalign-0.6.1 725.21KB

查看:2024-01-20发行的版本