Reuse search results when given a partially filled directory #98

kmaziarz · 2024-08-22T12:46:19Z

When running search on many targets using preemptible/unreliable compute, it may happen that the run fails having completed a fraction of targets. If such a run is restarted, it will generate a new timestamped output directory and start from scratch. This PR adds a flag append_timestamp_to_dir, which when set to False turns off timestamping, meaning that a restarted run would use the same directory as the old one. On top of this, it implements logic to skip targets which were completed earlier; a target that was partially completed (for example the search statistics were saved but it crashed when dumping the graph) will be purged and recomputed. Finally, the PR also includes a few small tweaks to the surrounding code: adding guards around visualization imports (to make sure search can be ran with the base environment without graphviz) and deleting an __init__.py file in an old empty directory.

syntheseus/cli/search.py

AustinT

LGTM, except for my comment under Marwin's

…otting is enabled

kmaziarz added 7 commits August 21, 2024 10:33

feat(search): Add flag to control timestamping of output directory name

ca68b35

feat(search): Resume from failure if output dir is not empty

2aabe05

test(search): Add tests for resuming after failure

fc882b1

fix(search): Make search runnable without graphviz

f2cc00b

chore(search): Shorten import

54bf0ed

chore(reaction_prediction): Remove empty subpackage

fdbe69b

doc(CHANGELOG): Add an entry for #98

82027f7

kmaziarz requested review from fiberleif, AustinT and mrwnmsr August 22, 2024 12:46

mrwnmsr reviewed Aug 22, 2024

View reviewed changes

syntheseus/cli/search.py Show resolved Hide resolved

AustinT approved these changes Aug 23, 2024

View reviewed changes

feat(search): Raise an error if viz dependencies are not found but pl…

c07f274

…otting is enabled

kmaziarz merged commit 1c8ef7a into main Aug 23, 2024
5 checks passed

kmaziarz deleted the kmaziarz/resume-after-failure branch August 23, 2024 13:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reuse search results when given a partially filled directory #98

Reuse search results when given a partially filled directory #98

kmaziarz commented Aug 22, 2024

AustinT left a comment

Reuse search results when given a partially filled directory #98

Reuse search results when given a partially filled directory #98

Conversation

kmaziarz commented Aug 22, 2024

AustinT left a comment

Choose a reason for hiding this comment