Add OptimumEmbedder #137

mathislucka · 2023-12-22T10:03:30Z

Is your feature request related to a problem? Please describe.
Huggin Face's Optimum library provides faster inference through ONNX and TensorRT. This can be used to create blazing fast embedding components. The concepts used in Optimum also play well with some of the concepts that we have in Haystack. For example:

Loading non-ONNX checkpoints requires a conversion step, this takes some time. We can do that step in our warmup function (https://huggingface.co/docs/optimum/onnxruntime/usage_guides/models#loading-a-vanilla-transformers-model).

Describe the solution you'd like

fast embedders based on Optimum
apply other tricks such as sorting by sequence length and dynamic padding to bring down inference time (see https://github.com/UKPLab/sentence-transformers/blob/c006921e9e9977bc107b05676266b581091688a2/sentence_transformers/SentenceTransformer.py#L179)

Describe alternatives you've considered

Not having a fast embedder component
https://github.com/qdrant/fastembed > supported models are limited

Additional context
https://colab.research.google.com/drive/10UAtpz26Gv2LtamT8j33LmI5UFQFwF4T?usp=sharing
https://github.com/huggingface/optimum-benchmark/tree/main/examples/fast-mteb

The text was updated successfully, but these errors were encountered:

shadeMe · 2024-04-25T13:49:27Z

Reopening as the docs are still missing. Nor have we announced this on social media (to the best of my knowledge).

dfokina · 2024-05-23T15:20:43Z

The docs are live: https://docs.haystack.deepset.ai/docs/optimumtextembedder, https://docs.haystack.deepset.ai/docs/optimumdocumentembedder

I believe only the social media announcement is left

mathislucka added the P2 label Dec 22, 2023

mathislucka transferred this issue from deepset-ai/haystack Dec 22, 2023

mathislucka added this to the 2.0 Embedders milestone Dec 22, 2023

julian-risch mentioned this issue Jan 2, 2024

Support more Embedders in Haystack 2.x deepset-ai/haystack#6669

Closed

masci added the contributions wanted! Looking for external contributions label Jan 11, 2024

mathislucka added P1 and removed P2 labels Feb 1, 2024

masci assigned vblagoje and unassigned vblagoje Feb 5, 2024

awinml mentioned this issue Feb 8, 2024

feat: Add Optimum Embedders #379

Merged

shadeMe self-assigned this Feb 9, 2024

anakin87 added this to Haystack - Contributions wanted Feb 10, 2024

anakin87 removed the contributions wanted! Looking for external contributions label Feb 12, 2024

anakin87 removed this from Haystack - Contributions wanted Feb 12, 2024

shadeMe mentioned this issue Feb 12, 2024

Push the OptimumEmbedders to their performance limits #384

Closed

shadeMe closed this as completed in #379 Feb 21, 2024

shadeMe reopened this Feb 21, 2024

shadeMe closed this as completed Mar 12, 2024

shadeMe reopened this Apr 25, 2024

shadeMe modified the milestones: 2.0 Embedders, Integrations 2.1.0 Apr 25, 2024

masci added P2 and removed P1 labels Apr 29, 2024

masci unassigned shadeMe May 21, 2024

masci added P1 and removed P2 labels May 21, 2024

masci assigned shadeMe May 21, 2024

shadeMe assigned bilgeyucel May 28, 2024

shadeMe removed their assignment May 28, 2024

bilgeyucel closed this as completed May 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OptimumEmbedder #137

Add OptimumEmbedder #137

mathislucka commented Dec 22, 2023 •

edited by bilgeyucel

Loading

Tasks

shadeMe commented Apr 25, 2024

dfokina commented May 23, 2024

Add OptimumEmbedder #137

Add OptimumEmbedder #137

Comments

mathislucka commented Dec 22, 2023 • edited by bilgeyucel Loading

Tasks

shadeMe commented Apr 25, 2024

dfokina commented May 23, 2024

mathislucka commented Dec 22, 2023 •

edited by bilgeyucel

Loading