v1.3.0
版本发布时间: 2024-06-28 19:37:18
huggingface/text-embeddings-inference最新发布版本:v1.5.0(2024-07-10 23:34:40)
Notable changes
- New truncation direction parameter
- Cuda support for JinaCode model architecture
- Cuda support for Mistral model architecture
- Cuda support for Alibaba GTE model architecture
- New prompt name parameter: you can now add a prompt name to the body of your request to add a pre-prompt to your input, based on the Sentence Transformers configuration. You can also set a default prompt / prompt name to always add a pre-prompt to your requests.
What's Changed
- Ci migration to K8s by @glegendre01 in https://github.com/huggingface/text-embeddings-inference/pull/269
- chore: map compute_cap from GPU name by @haixiw in https://github.com/huggingface/text-embeddings-inference/pull/276
- chore: cover Nvidia T4/L4 GPU by @haixiw in https://github.com/huggingface/text-embeddings-inference/pull/284
- feat(ci): add trufflehog secrets detection by @McPatate in https://github.com/huggingface/text-embeddings-inference/pull/286
- Community contribution code of conduct by @LysandreJik in https://github.com/huggingface/text-embeddings-inference/pull/291
- Update README.md by @michaelfeil in https://github.com/huggingface/text-embeddings-inference/pull/277
- Upgrade tokenizers to 0.19.1 to deal with breaking change in tokenizers by @scriptator in https://github.com/huggingface/text-embeddings-inference/pull/266
- Add env for OTLP service name by @kozistr in https://github.com/huggingface/text-embeddings-inference/pull/285
- Fix CI build timeout by @fxmarty in https://github.com/huggingface/text-embeddings-inference/pull/296
- fix(router): payload limit was not correctly applied by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/298
- feat(candle): better cuda error by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/300
- feat(router): add truncation direction parameter by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/299
- Support for Jina Code model by @patricebechard in https://github.com/huggingface/text-embeddings-inference/pull/292
- feat(router): add base64 encoding_format for OpenAI API by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/301
- fix(candle): fix FlashJinaCodeModel by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/302
- fix: use malloc_trim to cleanup pages by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/307
- feat(candle): add FlashMistral by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/308
- feat(candle): add flash gte by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/310
- feat: add default prompts by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/312
- Add optional CORS allow any option value in http server cli by @kir-gadjello in https://github.com/huggingface/text-embeddings-inference/pull/260
- Update
HUGGING_FACE_HUB_TOKEN
toHF_API_TOKEN
in README by @kevinhu in https://github.com/huggingface/text-embeddings-inference/pull/263 - v1.3.0 by @OlivierDehaene in https://github.com/huggingface/text-embeddings-inference/pull/313
New Contributors
- @haixiw made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/276
- @McPatate made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/286
- @LysandreJik made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/291
- @michaelfeil made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/277
- @scriptator made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/266
- @fxmarty made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/296
- @patricebechard made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/292
- @kir-gadjello made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/260
- @kevinhu made their first contribution in https://github.com/huggingface/text-embeddings-inference/pull/263
Full Changelog: https://github.com/huggingface/text-embeddings-inference/compare/v1.2.3...v1.3.0