feat: expose resetting run boundaries #112

drake-nominal · 2024-10-29T22:02:24Z

When creating a run, a user is required to enter in the start / end timestamps before:

All datasets are added to the run
All data is present in the datasets being added to the run (e.g. a dataset is mid ingest via multi-file or may otherwise be updated in the future)

As a result, customers can end up in a situation where the start / end bounds of a run aren't particularly accurate as data continues to be ingested into the platform, and having a simple way to just "reset" the bounds turns out to be powerful.

alkasm · 2024-10-29T22:10:25Z

All data is present in the datasets being added to the run (e.g. a dataset is mid ingest via multi-file or may otherwise be updated in the future)

IIUC this PR doesn't fix this issue? The datasets that are still mid-ingest are filtered out.

I'd like to minimize non-idempotent mutations as first-class functionality in the lib.

drake-nominal · 2024-10-29T22:32:02Z

IIUC this PR doesn't fix this issue? The datasets that are still mid-ingest are filtered out.

@alkasm in a single file world, sure, but this is a pretty rare edge case in the long run. Consider the case where the customer has 50000 files that compose one of the datasets instead-- now "mid ingest" can still mean that the dataset shows up as "ingested" in product. Or perhaps new files get added later after the run is created.

…ssing dataset bounds

drake-nominal requested a review from alkasm October 29, 2024 22:02

drake-nominal self-assigned this Oct 29, 2024

drake-nominal force-pushed the deidukas/reset-run-bounds branch from b875393 to 9bde0f6 Compare October 29, 2024 22:28

drake-nominal force-pushed the deidukas/add-dataset-bounds branch from adcab95 to 2aac0a0 Compare October 29, 2024 22:45

Base automatically changed from deidukas/add-dataset-bounds to main October 29, 2024 23:18

drake-nominal added 2 commits November 15, 2024 12:53

Add method to reset start & end boundaries on a run

efc6632

Make resetting run boundaries more graceful / no-op in the case of mi…

35830ab

…ssing dataset bounds

drake-nominal force-pushed the deidukas/reset-run-bounds branch from 9bde0f6 to 35830ab Compare November 15, 2024 21:01

drake-nominal and others added 4 commits November 15, 2024 13:26

Add warning

d6e301b

Formatting

4301d8b

Merge branch 'main' into deidukas/reset-run-bounds

a5187e9

Merge branch 'main' into deidukas/reset-run-bounds

571cb1c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: expose resetting run boundaries #112

feat: expose resetting run boundaries #112

drake-nominal commented Oct 29, 2024

alkasm commented Oct 29, 2024 •

edited

Loading

drake-nominal commented Oct 29, 2024 •

edited

Loading

feat: expose resetting run boundaries #112

Are you sure you want to change the base?

feat: expose resetting run boundaries #112

Conversation

drake-nominal commented Oct 29, 2024

alkasm commented Oct 29, 2024 • edited Loading

drake-nominal commented Oct 29, 2024 • edited Loading

alkasm commented Oct 29, 2024 •

edited

Loading

drake-nominal commented Oct 29, 2024 •

edited

Loading