Explore ways to always read from the slot #690

iskakaushik · 2023-11-21T13:18:17Z

Anecdotally what we have noticed is that restarting START_REPLICATION for each SyncFlow or even having multiple sync flows isn't ideal. We should always be reading from the slot if at all possible. There are a couple of approaches that come to my mind:

Option 1: ACK channel

One of the main reasons we have multiple sync flows is that after each syncflow we update the metadata on the destination marking the last committed source LSN, this makes it durable for us to restart sync flows from this point, and to them flush this via standby message. We do this once per batch.

One alternative approach could be to:

once the destination gets max_batch_size number of rows, we update the metadata.
send the committed lsn to the PullRecords via a channel, and issue a flush.
we just have 1 sync flow this way.

Option 2: Landing Zone

This is a more drastic shift in how we currently do things. Have some kind of a durable store (maybe kafka or redpanda). Always read from the slot and push it to this landing zone. This way the slot is always being read from. Periodically push from the landing zone to the destination. This idea needs to be fleshed out in more detail.

@serprex for thoughts.

cc: @saisrirampur

The text was updated successfully, but these errors were encountered:

saisrirampur · 2023-11-25T19:53:39Z

Just make sure that START_REPLICATION doesn't incur a long running transaction and blocks autovaccum. I am not sure if START_REPLICATION is like a regular postgres transaction.

@serprex @iskakaushik

serprex · 2024-02-22T14:24:03Z

#1211

https://www.youtube.com/watch?v=8xIXmlwLs3g

serprex assigned serprex and iskakaushik Nov 24, 2023

serprex closed this as completed Feb 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explore ways to always read from the slot #690

Explore ways to always read from the slot #690

iskakaushik commented Nov 21, 2023 •

edited

Loading

saisrirampur commented Nov 25, 2023 •

edited

Loading

serprex commented Feb 22, 2024

Explore ways to always read from the slot #690

Explore ways to always read from the slot #690

Comments

iskakaushik commented Nov 21, 2023 • edited Loading

Option 1: ACK channel

Option 2: Landing Zone

saisrirampur commented Nov 25, 2023 • edited Loading

serprex commented Feb 22, 2024

iskakaushik commented Nov 21, 2023 •

edited

Loading

saisrirampur commented Nov 25, 2023 •

edited

Loading