Enhance stability of minimization #557

jfennick · 2022-03-09T02:26:09Z

Description

I have been experiencing some instabilities which, after much difficulty, I was able to track down to the minimizer. The observed behavior is that the minimizer will finish 'successfully', but some time later, during equilibration and/or propagating replicas, the simulation may or may not randomly crash with NaNs or even throw CUDA_ERROR_ILLEGAL_ADDRESS (700).

Nearly identical behavior can be found here openmm/openmm#3414. In that ticket, the issue was that the box vectors were not being updated correctly. For stability, during minimization the box vectors should not be allowed to change at all. Subsequent equilibration at NPT should be used to adjust the box vectors.

In my case(s), the box vectors were changing only at the fourth significant figure, and yet that was sufficient to (usually) crash the simulations. Inserting Gradient Descent before FIRE seems to greatly ameliorate but not completely eliminate the instabilities. Replacing FIRE with L-BFGS (with or without Gradient Descent) also seems to fix the instabilities.

This PR adds three changes to improve the stability of minimization:

First and foremost, it temporarily disables the barostat during minimization to keep the box vectors fixed. This is probably the only change that is absolutely necessary to fix the instabilities.
It inserts a very short (24 timestep) Gradient Descent minimization before the main FIRE minimization. Gradient Descent is not asymptotically fast, but it is excellent for preconditioning a 'better' minimizer.
I have replaced FIRE with L-BFGS. Unlike FIRE, L-BFGS does not modify the box vectors (even when the pressure is not None) and it's a fine minimizer anyway.

Todos

Implement feature / fix bug
Add tests
Update documentation as needed
Update changelogNotable points that this PR has either accomplished or will accomplish.

Status

Ready to go

jfennick · 2022-03-09T02:28:06Z

See also choderalab/yank#1267

mikemhenry · 2022-03-09T18:21:42Z

@jfennick Thanks for this! I'll have @jchodera take a look!

jfennick · 2022-03-09T18:27:36Z

FYI I just realized that I accidentally named this branch stable_equilibration instead of stable_minimization.

jchodera

@ijpulidos : This is a great step forward, but there are a few risky things in here we should address before pulling it into a bugfix release. I'll look into this in more detail.

jchodera · 2022-04-04T20:46:41Z

openmmtools/multistate/multistatesampler.py

+        thermodynamic_state.pressure = None
+
+        # Use Gradient Descent first for numerical stability
+        integrator_grad = GradientDescentMinimizationIntegrator()


The initial step size here is not guaranteed to be robust. I wonder if there's a way we can automatically select a more robust initial step size, or if this is not a critical issue.

In some of my other work with gromacs, I use emstep 0.0001 (nanometers) which is 10 times smaller than the openmmtools default of 0.01 Angstroms. gromacs also has an adaptive step size, and I have some plots showing that starting with 0.001 Angstroms, the adaptive step size increases for the first ~25 iterations before stabilizing. In other words, you're only 'wasting' at most ~25 iterations, which isn't even a rounding error in the total runtime. Of course, those iterations are well spent if they can prevent minimization failures.

jchodera · 2022-04-04T20:48:58Z

openmmtools/multistate/multistatesampler.py

+
+        # Use Gradient Descent first for numerical stability
+        integrator_grad = GradientDescentMinimizationIntegrator()
+        context_grad = thermodynamic_state.create_context(integrator_grad)


It's good that we're creating the context and then cleaning it up again, since we don't repeatedly re-use the minimization context, but it would be useful if we also specified platform properties to ensure we use mixed precision if needed for the propagation or energy computation context caches.

jfennick · 2022-05-27T20:41:07Z

I'm not sure why this is failing CI.
pytest openmmtools/tests/test_sampling.py -k 'test_minimize'
works on my machine.

jfennick · 2022-06-03T18:33:13Z

Are there any thoughts on this PR? Again, disabling the barostat is really the only thing that is critical, which is why I originally separated it out into it's own 3-line commit. Perhaps it was a mistake to add additional enhancements. Can we at least merge the pressure bug in the first commit?

ijpulidos · 2022-06-03T18:51:44Z

@jfennick Thanks for your contributions. I'm tagging @jchodera for him to take another look into these, sine we need to see if the previously raised concerns are solved. I will also review this in detail, since this can potentially change many things in our calculations. For now I'm marking this to be released on our 0.22 milestone.

ijpulidos · 2023-03-28T15:17:12Z

Resolves #668

jfennick mentioned this pull request Mar 9, 2022

Stable minimization choderalab/yank#1267

Closed

jfennick changed the title ~~Stable equilibration~~ Stable minimization Mar 10, 2022

mikemhenry mentioned this pull request Mar 10, 2022

Improve robustness against OpenMMException (e.g. CUDA_ERROR_ILLEGAL_ADDRESS) choderalab/perses#928

Open

jchodera self-requested a review March 17, 2022 17:09

jchodera self-assigned this Mar 17, 2022

jchodera added 🌠 enhancement ⬆️ high-priority labels Mar 17, 2022

jchodera changed the title ~~Stable minimization~~ Enhance stability of minimization Mar 17, 2022

ijpulidos self-requested a review April 4, 2022 20:39

ijpulidos added this to the 0.21.3 milestone Apr 4, 2022

jchodera reviewed Apr 4, 2022

View reviewed changes

jchodera removed this from the 0.21.3 milestone Apr 4, 2022

ijpulidos requested a review from jchodera June 1, 2022 22:40

ijpulidos added this to the 0.22.0 milestone Jun 3, 2022

ijpulidos mentioned this pull request Jan 10, 2023

Eliminate FireMinimizer in multistate sampler. #645

Open

ijpulidos mentioned this pull request Mar 21, 2023

Use LocalEnergyMinimizer instead of FireMinimizer #668

Open

ijpulidos mentioned this pull request Mar 28, 2023

Replacing Fire minimization with LocalEnergyMinimizer #672

Open

5 tasks

jfennick added 5 commits April 4, 2023 16:37

minimize at NVT

f8dd5c7

add Gradient Descent minimization before FIRE

246469e

replace FIRE minimizer with L-BFGS

b928b56

added platform properties

704e4ec

decreased initial step size

45dc0ee

ijpulidos removed this from the 0.22.0 milestone Apr 6, 2023

ijpulidos added this to the 0.22.1 milestone Apr 6, 2023

ijpulidos removed this from the 0.22.1 milestone May 31, 2023

jfennick closed this by deleting the head repository Aug 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance stability of minimization #557

Enhance stability of minimization #557

jfennick commented Mar 9, 2022

jfennick commented Mar 9, 2022

mikemhenry commented Mar 9, 2022

jfennick commented Mar 9, 2022

jchodera left a comment

jchodera Apr 4, 2022

jfennick May 27, 2022

jchodera Apr 4, 2022

jfennick May 27, 2022

jfennick commented May 27, 2022

jfennick commented Jun 3, 2022

ijpulidos commented Jun 3, 2022

ijpulidos commented Mar 28, 2023

Enhance stability of minimization #557

Enhance stability of minimization #557

Conversation

jfennick commented Mar 9, 2022

Description

Todos

Status

jfennick commented Mar 9, 2022

mikemhenry commented Mar 9, 2022

jfennick commented Mar 9, 2022

jchodera left a comment

Choose a reason for hiding this comment

jchodera Apr 4, 2022

Choose a reason for hiding this comment

jfennick May 27, 2022

Choose a reason for hiding this comment

jchodera Apr 4, 2022

Choose a reason for hiding this comment

jfennick May 27, 2022

Choose a reason for hiding this comment

jfennick commented May 27, 2022

jfennick commented Jun 3, 2022

ijpulidos commented Jun 3, 2022

ijpulidos commented Mar 28, 2023