Enhancement Request: List keys (files) in S3 bucket using paginators #9

adrianyorke · 2019-12-12T15:20:06Z

I would like to connect to an existing S3 bucket and list the keys (files). This will then drive further test cases on a key-by-key basis as we fetch files in order and perform data quality checks such as row counts, hash value, average or total of numerical columns, etc..

There is a good write-up here which explains how this could be implemented using paginators. Paginators simplify paging complexities for buckets that contain large amounts of files.

teaglebuilt · 2019-12-14T22:09:20Z

this is a good place to start

teaglebuilt · 2019-12-14T22:10:08Z

if know one has claimed this issue. I will take it on

adrianyorke · 2019-12-14T23:38:07Z

Go for it @teaglebuilt. I will review/test and merge your pull request if that method works for you?

adrianyorke · 2019-12-14T23:41:00Z

Regarding PRs - would you prefer that we create a branch for each patch or are you happy just to work on master for now? Bigger projects (like robot framework core) normally prefer a separate branch for each patch but smaller projects can just work on master for simple fixes and enhancements.

teaglebuilt · 2019-12-15T17:07:13Z

i think we should always create pull requests.setting master as the tracking branch works instead of creating a dev branch for now. Maybe this will change when we have caught up. As far as documentation is concerned, i dont want the wait of pul request to hold back errors or incorrect documentation. I definately do not want to work off master from this point forward. If you are a collaborator, then you should be able to change incorrect information / documentation without a pull request. Like all of the markdown documentation, keyword docs, and so on

teaglebuilt · 2019-12-15T17:08:01Z

I have used robots libdoc for auto documentation, pre commit for linting, and I will set up travis ci to deploy to pypi on tag releases

teaglebuilt · 2019-12-15T18:29:39Z

after starting playing around with paginators, i think we need to think about all the s3 keywords that we want to create using this. This issue keyword should list all by page, or by prefix?

preferred keyword name?
other keyword offsets? For example:

List Keys params: bucket, prefix ?

Or

List Keys Bucket
List Keys by Prefix Bucket Prefix

adrianyorke · 2019-12-16T22:15:48Z

List Keys is what I had in mind. Should match the boto function. One thing to consider is that there may be many 1000s of keys in a single bucket so filtering option would be useful.

teaglebuilt · 2019-12-17T16:34:31Z

alright

NeoMorfeo · 2020-04-29T11:29:56Z

Any update on this? required help to develop?

adrianyorke · 2020-04-29T17:57:20Z

@NeoMorfeo: @teaglebuilt commented 15 Dec: "if know one has claimed this issue. I will take it on".

NeoMorfeo · 2020-04-29T18:02:16Z

thanks @adrianyorke 😄, @teaglebuilt It will be nice to have, please ask me if required.

Also a good point to is to search/filter on the bucket, by prefix, as indicated in the other comment, for my has more sense to mimic the boto and have one Keyword with List Keys pagination, prefix and so.

Thanks in advance

adrianyorke · 2020-04-29T18:23:27Z

@NeoMorfeo: Contributions are most welcome and I am happy to test and review. First take a look at the Contributing guidelines: https://github.com/teaglebuilt/robotframework-aws/blob/master/CONTRIBUTING.md

Let's wait to hear from @teaglebuilt before you put too much effort into this - he may have the solution sitting some local branch so let's not waste time fixing it again until we've heard back from him.

NeoMorfeo · 2020-06-24T11:23:50Z

No news about this @teaglebuilt or @adrianyorke ? then maybe I will implement by myself and ask for a PR :D because i need to improve this as much as posible :=)

teaglebuilt · 2020-06-24T11:25:37Z

@NeoMorfeo what are you asking. You are free to contribute and if you submitted a pull request I’ll pull it down and test it. This repo needs your help

NeoMorfeo · 2020-06-25T06:10:57Z

@teaglebuilt no worries, just wondering if you guys spoke about this, nothing morry, sorry for bother :(

I will make a change on the code, and will ask for PR to check if fits to your standars. Thanks!

teaglebuilt · 2020-06-25T14:04:41Z

@NeoMorfeo great i am sure it will, the only thing is we need to write a unit test and robotframework test for each keyword or modify it based on the changes made. Under the test directory there should be a folder for unit tests and RF/acceptance tests.

NeoMorfeo · 2020-06-26T06:45:33Z

Ok @teaglebuilt also i will follow the Contributing guidelines as @adrianyorke sugest 😄

Now i need time to implement :)

teaglebuilt · 2020-07-09T18:41:32Z

@NeoMorfeo hows it coming? any roadblocks or issues?

NeoMorfeo · 2020-07-09T19:17:04Z

No no, sorry, I have busy times and no time to develop this, but I'm over it... Sorry for delaying

adrianyorke added enhancement New feature or request good first issue Good for newcomers labels Dec 12, 2019

adrianyorke changed the title ~~Enhancement Request: List keys in S3 bucket using paginators~~ Enhancement Request: List keys (files) in S3 bucket using paginators Dec 12, 2019

teaglebuilt self-assigned this Dec 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhancement Request: List keys (files) in S3 bucket using paginators #9

Enhancement Request: List keys (files) in S3 bucket using paginators #9

adrianyorke commented Dec 12, 2019

teaglebuilt commented Dec 14, 2019

teaglebuilt commented Dec 14, 2019

adrianyorke commented Dec 14, 2019 •

edited

Loading

adrianyorke commented Dec 14, 2019

teaglebuilt commented Dec 15, 2019

teaglebuilt commented Dec 15, 2019

teaglebuilt commented Dec 15, 2019

adrianyorke commented Dec 16, 2019

teaglebuilt commented Dec 17, 2019

NeoMorfeo commented Apr 29, 2020

adrianyorke commented Apr 29, 2020

NeoMorfeo commented Apr 29, 2020

adrianyorke commented Apr 29, 2020

NeoMorfeo commented Jun 24, 2020

teaglebuilt commented Jun 24, 2020 •

edited

Loading

NeoMorfeo commented Jun 25, 2020

teaglebuilt commented Jun 25, 2020

NeoMorfeo commented Jun 26, 2020

teaglebuilt commented Jul 9, 2020

NeoMorfeo commented Jul 9, 2020

Enhancement Request: List keys (files) in S3 bucket using paginators #9

Enhancement Request: List keys (files) in S3 bucket using paginators #9

Comments

adrianyorke commented Dec 12, 2019

teaglebuilt commented Dec 14, 2019

teaglebuilt commented Dec 14, 2019

adrianyorke commented Dec 14, 2019 • edited Loading

adrianyorke commented Dec 14, 2019

teaglebuilt commented Dec 15, 2019

teaglebuilt commented Dec 15, 2019

teaglebuilt commented Dec 15, 2019

adrianyorke commented Dec 16, 2019

teaglebuilt commented Dec 17, 2019

NeoMorfeo commented Apr 29, 2020

adrianyorke commented Apr 29, 2020

NeoMorfeo commented Apr 29, 2020

adrianyorke commented Apr 29, 2020

NeoMorfeo commented Jun 24, 2020

teaglebuilt commented Jun 24, 2020 • edited Loading

NeoMorfeo commented Jun 25, 2020

teaglebuilt commented Jun 25, 2020

NeoMorfeo commented Jun 26, 2020

teaglebuilt commented Jul 9, 2020

NeoMorfeo commented Jul 9, 2020

adrianyorke commented Dec 14, 2019 •

edited

Loading

teaglebuilt commented Jun 24, 2020 •

edited

Loading