mixed case support for CDC mirrors in Snowflake and Postgres #589

heavycrystal · 2023-10-29T13:28:43Z

No description provided.

serprex · 2023-11-07T17:04:49Z

flow/connectors/postgres/client.go

@@ -497,6 +497,7 @@ func (c *PostgresConnector) generateFallbackStatements(destinationTableIdentifie
 		}
 	}
 	flattenedCastsSQL := strings.TrimSuffix(strings.Join(flattenedCastsSQLArray, ","), ",")
+	parsedDstTable, _ := utils.ParseSchemaTable(destinationTableIdentifier)


put this at top before normalizedTableSchema would probably run into a nil dereference & handle errors

serprex · 2023-11-07T17:07:34Z

flow/connectors/postgres/client.go

@@ -529,6 +530,7 @@ func (c *PostgresConnector) generateMergeStatement(destinationTableIdentifier st
 	for i, columnName := range columnNames {
 		columnNames[i] = fmt.Sprintf("\"%s\"", columnName)
 	}
+	parsedDstTable, _ := utils.ParseSchemaTable(destinationTableIdentifier)


On one level it feels like this could be made more efficient with a utils.SanitizeSchemaTable where it'd avoid allocations in case where destinationTableIdentifier == parsedDstTable.String() but in another level it seems like you're already too late here if for stranger names like those containing a period. But maybe character stripping avoids that for now

func SanitizeSchemaTable(dirty string) { var sb strings.Builder sb.Grow(len(dirty) + 4) sb.WriteRune('"') sb.WriteString(strings.Replace(dirty, ".", "\".\"", 1)) sb.WriteRune('"') return sb.String() }

Something like this. Would have to profile to see if it actually brings any benefit. Probably over optimizing. This draft also fails to include the return dirty if dirty matches [a-zA-Z0-9.]+ logic

serprex · 2023-11-07T17:24:09Z

flow/connectors/snowflake/client.go

+	// https://www.alberton.info/dbms_identifiers_and_case_sensitivity.html
+	// Snowflake follows the SQL standard, but Postgres does the opposite.
+	// Ergo, we suffer.
+	if strings.ToLower(identifier) == identifier {


A utils.IsLower(string) / utils.IsUpper(string) could be implemented using unicode package's IsLetter / IsUpper / IsLower

https://pkg.go.dev/unicode

serprex · 2023-11-07T17:42:16Z

flow/connectors/snowflake/qrep_avro_consolidate_handler.go

+		}
+	}
+
+	switch appendMode {


Suggested change

switch appendMode {

if appendMode {

… and Postgres

serprex reviewed Nov 7, 2023

View reviewed changes

heavycrystal added 2 commits November 27, 2023 14:37

mixed case table and column name support for CDC mirrors in Snowflake…

1197c3e

… and Postgres

fixing tests pt.1

7920d15

heavycrystal force-pushed the pg-sf-mixed-case branch from 1834283 to 7920d15 Compare November 27, 2023 09:10

heavycrystal closed this Feb 28, 2024

serprex deleted the pg-sf-mixed-case branch July 19, 2024 15:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mixed case support for CDC mirrors in Snowflake and Postgres #589

mixed case support for CDC mirrors in Snowflake and Postgres #589

heavycrystal commented Oct 29, 2023

serprex Nov 7, 2023

serprex Nov 7, 2023

serprex Nov 7, 2023 •

edited

Loading

serprex Nov 7, 2023

serprex Nov 7, 2023

mixed case support for CDC mirrors in Snowflake and Postgres #589

mixed case support for CDC mirrors in Snowflake and Postgres #589

Conversation

heavycrystal commented Oct 29, 2023

serprex Nov 7, 2023

Choose a reason for hiding this comment

serprex Nov 7, 2023

Choose a reason for hiding this comment

serprex Nov 7, 2023 • edited Loading

Choose a reason for hiding this comment

serprex Nov 7, 2023

Choose a reason for hiding this comment

serprex Nov 7, 2023

Choose a reason for hiding this comment

serprex Nov 7, 2023 •

edited

Loading