Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spark 3.1.1 copy missing data #101

Closed

Conversation

Rumbles
Copy link

@Rumbles Rumbles commented Aug 9, 2023

A quick attempt at adding a method to copy missing rows

Links to #100

@@ -139,10 +139,63 @@ object Validator {

val failures = runValidation(migratorConfig)

if (failures.isEmpty) log.info("No comparison failures found - enjoy your day!")
else {
if (failures.isEmpty) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

don't forget that
config.validation.failuresToFetch
will limit above failures
so figuring out proper https://github.com/scylladb/scylla-migrator/blob/master/config.yaml.example#L217 is the limit for this list

log.error("Found the following comparison failures:")
log.error(failures.mkString("\n"))

// Copy missing data here based on the discrepancies found
copyMissingData(migratorConfig, failures)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

since this doesn't partition it will work only for small / tiny amount of rows

@Rumbles Rumbles closed this Sep 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants