Duplicate exps and exp naming #7879
Replies: 2 comments 2 replies
-
The same set of parameters will still try to generate a duplicate experiment but then fail at the end, right? I think this is the worst option and it always seems inconsistent and surprising to me (sometimes I still end up with duplicated experiments). Either we should fail to queue/start the experiment (with an exception for checkpoints), or we should allow duplicates. |
Beta Was this translation helpful? Give feedback.
-
Yes, I also noticed this, and I think we need a better way to handle checkpoint exps. On the queue side, they are regarded as a single experiment when the worker runs them. But on the
Not quite understand why
I think it is OK.
We can check the naming duplication in previous before the experiment is started. |
Beta Was this translation helpful? Give feedback.
-
This is also related to the bug where generated final exp names for checkpoint exps can't be used with
queue kill
andqueue remove
(the queued exp isn't associated with the separate generated exp name).We probably need to just revisit exp automatic naming, the simplest solution here would be to just generate random exp names when they are queued instead of using the current method of naming exps based on the pipeline result. However, this means we would lose the "prevent duplicate exps" behavior which we have now (but in previous discussions it's been raised that this may not be desirable behavior in the first place).
Basically, if a user runs
exp run
twice with the same set of parameters, is it ok for us to generate two separate experiments in git (even though they have the same result)? On the DVC side we would still be using run-cache, so the subsequent runs wouldn't be fully reproduced/duplicated. Butexp show
/queue status
/etc would all show two entries.If having duplicate exps is acceptable then we can just auto generate names right away, so queued exp names will always match the final ones.
Originally posted by @pmrowla in #7592 (comment)
Beta Was this translation helpful? Give feedback.
All reactions