-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature/ecs paths mapping script #2197
Conversation
…related mapping tables will not be path-changed.
…ping-service-url`. `-f/--fields-to-map=[hdfsPath|hdfsPublishPath|all(default)]` param has been added. - there is now no `pathChanged` mark in the DB, instead - the -s/--skip-prefix value is check to determined if path-changed or not.
… ECS service mapping check added; exception communicates the HDFS path used
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- pull
- code review
- manually test
… and default to usage `-s s3a:// /tmp`
Skip prefixes feature has been updated:
|
SonarCloud Quality Gate failed. 0 Bugs No Coverage information Catch issues before they fail your Quality Gate with our IDE extension SonarLint |
Merging - tested internally. Jenkins build bears no relevance here, this is a separate migration Python script |
This PR adds a script to remap hdfs paths based on a service response. Primary usage is for hdfs to ECS migration (with defaults set for this purpose), but the script is general in nature.
Naively Dev-Tested on local mongoDB.
When reading and thinking that some parts have no relation to this script (e.g.
migration_free_only=False
), note, that this script reuses a lot of the siblingmigrate_menas.py
- that being the reason.Examples
Help with params overview:
Example run for datasets
DM9_actn_Cd
andDM9_cnsmr_accnt_Sttlmnt
-d
- dataset-t
- target db-u
- mapping service URL-o
- only map datasets, not related mapping tables-f hdfsPublishPath
- onlyhdfsPublishPath
field will get path-changed (sohdfsPath
will be kept as-is).Example run for dataset
XMSK083
- has mapping table ties:-d
- dataset-t
- target db-u
mapping service URL-n
dryrun (just print)-v
verbose