Replies: 3 comments 6 replies
-
Hey @nuric, I am glad you are finding the project useful. You are correct that integrating vector databases could bloat the project. But one of the reasons we wanted to do this is to make it easy to stream the embeddings straight into the vector database as they are being made so that the embeddings don't live on the system memory. This could be useful for larger workloads where several files are very large. I am happy to consider other solutions to not have the embeddings live in memory. Let me know if you have some ideas. |
Beta Was this translation helpful? Give feedback.
-
It'd be good to integrate with an existing library that implements the wrappers mentioned for each Vector DBs API. I develop and maintain Vector-io which does this for 10 different vector DBs. The intent at the moment is to use it for data migrations and backups, but open to expanding that scope to allow it to be a common interface for all vector DBs. |
Beta Was this translation helpful? Give feedback.
-
Hi @nuric , We have also added vector database adapters, and we would love it if you could contribute Sema db here. https://github.com/StarlightSearch/EmbedAnything/tree/main/examples/adapters Thanks |
Beta Was this translation helpful? Give feedback.
-
Thank you for this great project! I really like how easy it is to get the embeddings 🚀
I saw in your road map you mention about integrating with vector databases. I think this repo would be better off not having such integrations because it bloats the codebase so much. There are many versions of vector databases and you might end up wrapping around wrappers for client libraries. It's not too difficult to just take the embeddings and call a function to put them in a vector DB.
I'm the author of SemaDB so I thought I give a perspective from a vector database.
Beta Was this translation helpful? Give feedback.
All reactions