Enable use of embeddings from Vision Language models #3452

jacopo-chevallard · 2024-11-04T15:56:27Z

Currently, we use text embeddings. This is fine for textual documents, while it present obvious drawbacks for documents containing non-textual content (images, graphs, schemes, …).

An alternative, is to use Visual Language models such as ColPali (see also https://huggingface.co/blog/manu/colpali, https://danielvanstrien.xyz/posts/post-with-code/colpali-qdrant/2024-10-02_using_colpali_with_qdrant.html, https://blog.vespa.ai/retrieval-with-vision-language-models-colpali/, https://blog.vespa.ai/scaling-colpali-to-billions/)

linear · 2024-11-04T15:56:28Z

CORE-280 Enable use of embeddigns Vision Language models

jacopo-chevallard added area: backend Related to backend functionality or under the /backend directory rag: ingestion rag: retrieval labels Nov 4, 2024 — with Linear

jacopo-chevallard self-assigned this Nov 4, 2024

jacopo-chevallard changed the title ~~Enable use of embeddigns Vision Language models~~ Enable use of embeddings Vision Language models Nov 4, 2024

jacopo-chevallard changed the title ~~Enable use of embeddings Vision Language models~~ Enable use of embeddings from Vision Language models Nov 4, 2024

dosubot bot added the enhancement New feature or request label Nov 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable use of embeddings from Vision Language models #3452

Enable use of embeddings from Vision Language models #3452

jacopo-chevallard commented Nov 4, 2024 •

edited

Loading

linear bot commented Nov 4, 2024

Enable use of embeddings from Vision Language models #3452

Enable use of embeddings from Vision Language models #3452

Comments

jacopo-chevallard commented Nov 4, 2024 • edited Loading

linear bot commented Nov 4, 2024

jacopo-chevallard commented Nov 4, 2024 •

edited

Loading