v0.1.33-rc5
版本发布时间: 2024-04-29 01:51:17
ollama/ollama最新发布版本:v0.1.38(2024-05-15 08:28:00)
Models:
- Llama 3: a new model by Meta, and the most capable openly available LLM to date
- Phi 3 Mini: a new 3.8B parameters, lightweight, state-of-the-art open model by Microsoft.
- Moondream moondream is a small vision language model designed to run efficiently on edge devices.
- Dolphin Llama 3: The uncensored Dolphin model, trained by Eric Hartford and based on Llama 3 with a variety of instruction, conversational, and coding skills.
- Qwen 110B: The first Qwen model over 100B parameters in size with outstanding performance in evaluations
What's Changed
- Fixed issues where the model would not terminate, causing the API to hang.
- Fixed a series of out of memory errors on Apple Silicon Macs
- Fixed out of memory errors when running Mixtral architecture models
Experimental concurrency features
New concurrency features are coming soon to Ollama. They are available
-
OLLAMA_NUM_PARALLEL
: Handle multiple requests simultaneously for a single model -
OLLAMA_MAX_LOADED_MODELS
: Load multiple models simultaneously
To enable these features, set the environment variables for ollama serve
. For more info see this guide:
OLLAMA_NUM_PARALLEL=4 OLLAMA_MAX_LOADED_MODELS=4 ollama serve
New Contributors
- @sidxt made their first contribution in https://github.com/ollama/ollama/pull/3705
- @ChengenH made their first contribution in https://github.com/ollama/ollama/pull/3789
- @secondtruth made their first contribution in https://github.com/ollama/ollama/pull/3503
- @reid41 made their first contribution in https://github.com/ollama/ollama/pull/3612
- @ericcurtin made their first contribution in https://github.com/ollama/ollama/pull/3626
- @JT2M0L3Y made their first contribution in https://github.com/ollama/ollama/pull/3633
- @datvodinh made their first contribution in https://github.com/ollama/ollama/pull/3655
- @MapleEve made their first contribution in https://github.com/ollama/ollama/pull/3817
- @swuecho made their first contribution in https://github.com/ollama/ollama/pull/3810
- @brycereitano made their first contribution in https://github.com/ollama/ollama/pull/3895
- @bsdnet made their first contribution in https://github.com/ollama/ollama/pull/3889
- @fyxtro made their first contribution in https://github.com/ollama/ollama/pull/3855
- @natalyjazzviolin made their first contribution in https://github.com/ollama/ollama/pull/3962
Full Changelog: https://github.com/ollama/ollama/compare/v0.1.32...v0.1.33-rc5
1、 ollama-darwin 49.63MB
2、 Ollama-darwin.zip 175.17MB
3、 ollama-linux-amd64 290.93MB
4、 ollama-linux-amd64-rocm.tgz 1.06GB
5、 ollama-linux-arm64 278.39MB
6、 ollama-windows-amd64.zip 427.05MB
7、 OllamaSetup.exe 197.83MB
8、 sha256sum.txt 601B