-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add one object storage based document store #758
Comments
Hello @lalitpagaria! Now I'm not entirely sure, but I think we already had this discussion previously (I'll link the issue if I find it back) and we came to the conclusion that Object stores like S3 simply did not have the features required to implement an Haystack document store. The addition of s3 select might change things, but I imagine that's not the only feature that was missing, unfortunately. However, if I find we had no issue open on this topic yet, let's keep this one to track of the status of these Object stores in the future 🙂 |
@ZanSara and @lalitpagaria |
@tstadel agree with you on the performance aspect. But there are few use cases where s3 can be a good alternative -
There are few articles about using S3 as a database https://dev.to/aws-builders/using-aws-s3-as-a-database-17l0 https://www.percona.com/blog/querying-archived-rds-data-directly-from-an-s3-bucket/ |
Currently, Haystack supports storing data into ElasticSearch, InMemory, and RDBMS.
It would be nice to add support of Object storage like S3, which is very cheap and have less hassle to maintain.
In the first step, AWS s3 can be supported as they recently added s3 select option which can help retrieve only a subset of data from an object (currently support CSV file object in compressed or uncompressed format).
Ideally, we can add a Metadata service as well which may help to use Haystack along with Data Lakes.
The text was updated successfully, but these errors were encountered: