Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consolidate s3 key config options #5

Closed
andrewcstewart opened this issue May 28, 2021 · 3 comments · Fixed by #12
Closed

Consolidate s3 key config options #5

andrewcstewart opened this issue May 28, 2021 · 3 comments · Fixed by #12

Comments

@andrewcstewart
Copy link
Collaborator

There are currently three different config fields related to s3 data location keys:

  • s3_key_prefix - Used in defining target_key for s3 upload, and used in data_locatioin for athena table definition.
  • naming_convention - Used only for defining target_key for s3 upload.
  • s3_staging_dir - Not being used in the sink function, but apparently still set as 'required' somewhere. (will fail CLI if not set)

We can probably consolidate these into at least one config option.

@andrewcstewart
Copy link
Collaborator Author

@aaronsteers - I'm personally of the opinion that we could just do away with naming_convention.. I think overly complicates key construction and doesn't particularly add any value to the user.

@aaronsteers
Copy link

@andrewcstewart

I'm personally of the opinion that we could just do away with naming_convention.

Agreed.

s3_staging_dir - Not being used in the sink function, but apparently still set as 'required' somewhere. (will fail CLI if not set)

I was confused about that also. This seemed to be implying we were needing multiple directories. Let's remove if not needed.

naming_convention - Used only for defining target_key for s3 upload.

I think we can remove for now, and re-add when/if we introduce path-based partitioning in #15.

@aaronsteers
Copy link

Bringing bucket back into this conversation, what if we standardize on just these two:

  • s3_bucket - The bucket.
  • s3_key_prefix - The 'prefix' indicator makes it clear to me that there will be multiple auto-generated files under this path, and the exact file names are not controlled via config. (At least not until we get to Add configurable partitioning logic in S3 #15, but even then we may use automatic name generation based on higher-order config.)

What do you think?

@andrewcstewart andrewcstewart linked a pull request Jun 21, 2021 that will close this issue
13 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants