-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add decent error recovery and Ctrl-C handling #14
Comments
@yarikoptic Is your suggested "analysis" step intended to be run over the entire file tree before doing anything? That would involve storing every single key in memory at once, which would likely be problematic. |
Do you think it would be too tricky to allow for two modes -- "staged" (the "analysis" based above) and "interleaved"? I guess if was in Python could be at the level of My thinking: we would need a full heavy list only in the initial backup. That is where we can proceed in "interleaved" mode, just getting a new key from the inventory, checking if present locally and if not -- yield the "action" to do to the next step. In "staged" -- it would establish the full list of actions to tackle first, and since changes should not be multitude if ran frequently -- should fit into memory easily IMHO. |
@yarikoptic That seems doable, though I really don't like the names "interleaved" and "staged". What about "immediate" and "pre-planned"? I also have to point out that, based on my observations of previous runs of the program, the "staged"/"pre-planned" mode, when run on the dandiarchive inventories, will have to spend about 10 and a half hours just going over all the items before it gets around to actually doing anything. In "staged"/"pre-planned" mode, when cleaning up after an error or Ctrl-C, do you want the actions performed so far to be rolled back or not? (Just what is the How do you expect removal of files from the local backup that don't correspond to any items in the inventories (#33) to work? Determining whether a file should be deleted basically requires storing all the CSV entries in memory at once. |
I have no strong opinion on names. "immediate" and "pre-planned" sound good.
I wonder why and either could be avoided e.g. via parallelization .
sorry for likely using inappropriate terminology -- I often use
Nothing 100% solid idea yet, but followed up on #33 (comment) |
Because there are over 400 CSVs to process, each with 300,000 lines, and they're already being processed concurrently (see the You didn't answer this question:
|
sure -- lots of work... do we know what it spends most of the time on? I am thinking about the
I guess so since we do need to be able to get into a "consistent state". |
@yarikoptic I'm going to repeat my suggestion from an earlier comment:
We know the program can do (b), and (a) is possible as long as the program was allowed to clean up and wasn't killed by SIGTERM or SIGKILL.1 Thus, I suggest that we don't implement any clever rollbacks on errors and instead just leave the error handling as-is. Footnotes
|
I wonder if we could/should at least get some |
@yarikoptic What exactly would the goal of that be? Why must an incomplete run only be followed by a run with the same options? |
I felt like for what you formulated as
but I guess indeed no need to be the same -- it might then just "update" it to a new state. But overall, I would just like to have an opportunity to know that prior run was not completed ok. |
@yarikoptic How about something like this: We add an
Then, if |
Prior discussion from dandi/dandi-utils#3:
jwodder:
yarikoptic:
jwodder:
yarikoptic:
jwodder:
yarikoptic:
jwodder:
yarikoptic:
jwodder:
yarikoptic:
The text was updated successfully, but these errors were encountered: