Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Glue Schema Registry schema replication converter #352

Open
wants to merge 49 commits into
base: master
Choose a base branch
from

Conversation

srakshit
Copy link

@srakshit srakshit commented Jul 15, 2024

Issue #, if available:
Replicate GSR schema and schema versions from one region to another

Description of changes:
Written a new Kafka Converter to replicate schema and schema versions cross-region

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Jessie Chen and others added 30 commits August 17, 2023 13:20
…ation-mm2-converter

New cross region replication mm2 converter
…rsions of the existing methods to ensure source schema compatibility with target schema
…2 versions of the existing methods to ensure source schema compatibility with target schema
Subham Rakshit added 8 commits July 9, 2024 16:59
… versions of the existing methods to ensure source schema compatibility with target schema
…ixed issue with how default configs are set with TARGET specific configs in AWSGlueCrossRegionSchemaReplicationConverter and GlueSchemaRegistryConfiguration
import software.amazon.awssdk.services.glue.model.RegisterSchemaVersionResponse;
import software.amazon.awssdk.services.glue.model.RegistryId;
import software.amazon.awssdk.services.glue.model.SchemaId;
import software.amazon.awssdk.services.glue.model.*;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We avoid using * imports in this code base. Can you please change this back?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed

glueSchemaRegistryConfiguration.getSourceRegion() != null &&
glueSchemaRegistryConfiguration.getSourceEndPoint() != null) {

glueSourceClientBuilder = GlueClient
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Instead of creating multiple clients here, can we pass in a region and create multiple instances of this client in different regions?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

With the new code, no change is required in this section.

* @return Map of SchemaV2 with VersionIds
* @throws AWSSchemaRegistryException on any error during the schema creation
*/
public Map<SchemaV2, UUID> createSchemaV2(String schemaName,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why SchemaV2?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed. Not needed with the new code.


import java.util.Map;
import java.util.UUID;
import java.util.*;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as above on imports.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Restored back to original with no changes

*/
@AllArgsConstructor
@Value
public class SchemaV2 {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we re-use the existing Schema class?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed. Not needed with the new code.

/**
* Target Registry Name.
*/
public static final String TARGET_REGISTRY_NAME = "target.registry.name";
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same as above. Converter specific code must be in converter.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed. Moved this to converter module


private void validateRequiredConfigsIfPresent(Map<String, ?> configs) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it possible to re-use any existing configs here?

import java.util.stream.Collectors;

@Slf4j
public class KafkaHelper {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't this already present?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Its there for Kafka related GSR tests but I had to modify it for schema replication tests. So preferred to create a copy of it.

out.log Outdated
@@ -0,0 +1 @@
nohup: ./run-local-tests.sh: No such file or directory
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we remove this file?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

removed

@@ -60,6 +63,9 @@ public class GlueSchemaRegistryDeserializationFacade implements Closeable {
@VisibleForTesting
protected LoadingCache<UUID, Schema> cache;

@VisibleForTesting
protected LoadingCache<UUID, SchemaV2> cacheV2;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can this be removed if we can figure out how to re-use Schema?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Removed. Not needed with the new code.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants