Support reading the credentials from the ~/.aws/credentials file within the Spark cluster #123

julienrf · 2024-03-30T14:03:16Z

Currently, the users of the migrator have to provide their AWS credentials through the config.yaml file. According to the description of #122, in some cases it is desirable to also read the AWS credentials from the user profile (~/.aws/credentials).

This PR addresses this need by using a capability of the Hadoop connector to set a custom AWS credentials provider. We set it to com.amazonaws.auth.profile.ProfileCredentialsProvider, which is the standard profile credentials provider from the AWS SDK.

Ultimately, if necessary we could make this configurable and allow our users to supply which credentials provider to use. This would give them full control on that.

According to #122 (comment), this PR fixes #122.

…r chain

julienrf · 2024-04-01T15:16:26Z

After a second thought, I wonder if we should also support reading credentials from environment variables (see also awslabs/emr-dynamodb-connector#185).

tarzanek · 2024-04-02T08:45:11Z

ideal way is to read from everywhere with some order of preference (and stick to spark ways https://spark.apache.org/docs/latest/configuration.html#dynamically-loading-spark-properties or similar with pre-defined hierarchy and inheritance)

tarzanek · 2024-04-02T08:55:43Z

@julienrf just fix the commit message - add a reference to
#122
(but don't close it)

I'd like to merge this to fix the credentials to the ones provided by config.yaml

Then in later PRs we can tackle env or assumed role approaches for DynamoDB access

tarzanek · 2024-04-02T11:44:53Z

merged by 00412dc
closing

Add the default ProfileCredentialsProvider to the credentials provide…

1162ac6

…r chain

julienrf changed the title ~~Add the default ProfileCredentialsProvider to the credentials provider chain~~ Support reading the credentials from the ~/.aws/credentials file within the Spark cluster Apr 1, 2024

julienrf marked this pull request as ready for review April 1, 2024 12:15

tarzanek mentioned this pull request Apr 2, 2024

DynamoDB migration is unable to read credentials. #122

Open

tarzanek closed this Apr 2, 2024

julienrf deleted the aws-credentials branch April 18, 2024 08:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support reading the credentials from the ~/.aws/credentials file within the Spark cluster #123

Support reading the credentials from the ~/.aws/credentials file within the Spark cluster #123

julienrf commented Mar 30, 2024 •

edited

Loading

julienrf commented Apr 1, 2024

tarzanek commented Apr 2, 2024

tarzanek commented Apr 2, 2024

tarzanek commented Apr 2, 2024

Support reading the credentials from the ~/.aws/credentials file within the Spark cluster #123

Support reading the credentials from the ~/.aws/credentials file within the Spark cluster #123

Conversation

julienrf commented Mar 30, 2024 • edited Loading

julienrf commented Apr 1, 2024

tarzanek commented Apr 2, 2024

tarzanek commented Apr 2, 2024

tarzanek commented Apr 2, 2024

julienrf commented Mar 30, 2024 •

edited

Loading