-
Notifications
You must be signed in to change notification settings - Fork 660
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Core feature] use of "url_to_fs" from fsspec.core in FileAccessProvider #3945
Comments
Thank you for opening your first issue here! 🛠 |
This is related to #3942. The PR there is attempting to add Azure specific args to the data handling portion of the
fsspec already supports default arguments to file systems though - https://filesystem-spec.readthedocs.io/en/latest/features.html#configuration. Is this something we can use @devictr? |
My idea would be to remove |
The
I think that could work. That way if we specify our preferred way of creating a filesystem using a config file or env variables, we can leave the kwargs to the user so they can override the default behaviour.
@timheb what do you mean by this? It seems like |
@devictr |
@timheb - I'm not quite sure I follow. I agree that the current But all this is kinda orthogonal to the issue at hand because you're creating your own file system right? you're trying to get the flytekit data persistence layer to trigger different permissions based on the path. But there's no guarantee that Wrt the things we can do to improve the current data_persistence::get_filesystem, we can
For background, the reason we need the anonymous handling is because s3 (and i think gcs as well) will fail to fetch a public object if you try to fetch it with credentials. weird, but that's how it is. because flytekit accesses arbitrary objects, it tries to pull anonymously if the non-anonymous way fails. |
Hello 👋, this issue has been inactive for over 9 months. To help maintain a clean and focused backlog, we'll be marking this issue as stale and will engage on it to decide if it is still applicable. |
Motivation: Why do you think this is important?
Currently it is difficult to use a custom fsspec protocol in flytekit. You can only implement configurations for s3 or gs. With
url_to_fs
you could implement dynamic configurations in your own protocol which can also be used as raw_output_prefix.Goal: What should the final outcome look like, ideally?
You can define your own fsspec protocols which are based on existing ones. Special credencial handling can be done in
_get_kwargs_from_urls
. Also, the protocol can be used as raw_output_prefixexample:
Describe alternatives you've considered
Do not know any alternative
Propose: Link/Inline OR Additional context
Could already be considered here: flyteorg/flytekit#1775
Are you sure this issue hasn't been raised already?
Have you read the Code of Conduct?
The text was updated successfully, but these errors were encountered: