v1.0.0
版本发布时间: 2024-02-24 00:43:19
huggingface/text-embeddings-inference最新发布版本:v1.5.0(2024-07-10 23:34:40)
Highlights
- Support for Nomic models
- Support for Flash Attention for Jina models
- Metal backend for M* users
-
/tokenize
route to directly access the internal TEI tokenizer -
/embed_all
route to allow client level pooling
What's Changed
- fix: limit the number of buckets for prom metrics by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/114
- feat: support flash attention for Jina by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/119
- feat: add support for Metal by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/120
- fix: fix turing for Jina and limit concurrency in docker build by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/121
- fix(router): fix panics on partial_cmp and empty req.texts by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/138
- feat(router): add /tokenize route by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/139
- feat(backend): support classification for bert by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/155
- feat: add embed_raw route to get all embeddings without pooling by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/154
- added docs for
OTLP_ENDPOINT
around the defaults and format sent by @MarcusDunn in https://github.com/huggingface/text-embeddings-inference/pull/157 - fix: use mimalloc to solve memory "leak" by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/161
- fix: remove modif of tokenizer by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/163
- fix: add cors_allow_origin to cli by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/162
- fix: use st max_seq_length by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/167
- feat: support nomic models by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/166
New Contributors
- @MarcusDunn made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/157
Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v0.6.0...v1.0.0