How to query CollectionStore and EmbeddingStore models directly in a clean way? #88

darahayes · 2024-07-11T16:23:26Z

Hello, thanks for this great project I've found it very useful. I have a use case right now where within one application I want to create and manage multiple collections as well as being able to fetch and return details about some collections, e.g. the name and the collection metadata - Essentially my use case is CRUD for collections.

Currently I don't really see any way to do that cleanly other than dropping down to raw SQL queries in my application. Would this be the recommended approach?

I see in the source code in vectorstores.py that there are "private"/unexposed SQLAlchemy models defined for CollectionStore and EmbeddingsStore. Having them exposed would make querying against the tables a lot easier, at least for my particular use case.

I can understand why you might want to keep them private - they might be subject to change and any user code that touches those models potentially breaks. But I think even when the models are not exposed, if there were changes that resulted in the database tables being different, this would still be a breaking change for a lot of apps anyways.

Is exposing those models something you might consider? Or would you recommend going with raw SQL? Would be more than happy to submit a PR. Thanks!

The text was updated successfully, but these errors were encountered:

eyurtsev · 2024-07-12T19:50:57Z

Hi @darahayes, there's no current way to do this.

This code needs to be refactored to support two things:

Add a control plane (IndexAdmin) that will do exactly what you need it to do.
Create different tables for the actual embeddings (e.g., to support different embedding dimensions)

Here's a stub at the abstraction that's needed: https://github.com/langchain-ai/langchain/pull/23990/files

This would also open up the pathway for being able to apply specific types of indices on the collections and do schema migration down the roads if necessary.

If you're interested in helping out, I can help provide some guidance if needed!

Sachin-Bhat · 2024-07-18T11:58:17Z

Hey @eyurtsev,

If more information is given I can take this up.

Cheers,
Sachin

eyurtsev added the help wanted Extra attention is needed label Jul 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to query CollectionStore and EmbeddingStore models directly in a clean way? #88

How to query CollectionStore and EmbeddingStore models directly in a clean way? #88

darahayes commented Jul 11, 2024

eyurtsev commented Jul 12, 2024 •

edited

Loading

Sachin-Bhat commented Jul 18, 2024

How to query CollectionStore and EmbeddingStore models directly in a clean way? #88

How to query CollectionStore and EmbeddingStore models directly in a clean way? #88

Comments

darahayes commented Jul 11, 2024

eyurtsev commented Jul 12, 2024 • edited Loading

Sachin-Bhat commented Jul 18, 2024

eyurtsev commented Jul 12, 2024 •

edited

Loading