Now that we have created 2 pipelines:
- GetTableList-Trigger-Incremental-Copy
- GetTableList-Trigger-sp
How would we look to automate them? What factors do we need to account for?
In this challenge your team will discuss / whiteboard various automating techniques and also factors that need to be accounted for when productionalizing an Incremental Pipeline.
Be able to articulate the options of automating these pipelines and what factors do we need to account for not only with regards to:
- Change Data Capture on the source data.
- What is the different between a watermark table and checking for the last "x" minutes. What are the pros and cons of each?
- How would we factor in timing and determine when to update the production tables in the Dedicated SQL Pool?
- What is the impact for downstream services such as reporting?
The following links may be useful to achieving the success crieria listed above.
Too comfortable? Eager to do more? Try these additional challenges!
- Create one or more scheduled trigger(s) to automate both pipelines above.