Add cfg option `--scheduler.warmup_min_lr` #542

epwalsh · 2024-04-11T21:08:00Z

For setting the starting LR during the warmup period. When no set this defaults to the current behavior of starting at 10% of the target LR.

For setting the starting LR during the warmup period.

dwadden · 2024-04-11T22:33:44Z

Thanks for doing this!

dwadden

lgtm

2015aroras

LGTM! Just minor comments

2015aroras · 2024-04-11T22:53:57Z

olmo/optim.py

@@ -480,7 +481,11 @@ def get_max_grad_norm_ratio(
        return self._get_max_grad_norm_coeff(initial_max_grad_norm_ratio, step, max_steps)

    def _linear_warmup(self, initial_lr: float, step: int, warmup_steps: int = 2000) -> float:
-        return initial_lr * (0.1 + 0.9 * min(step, warmup_steps) / warmup_steps)
+        if self.warmup_min_lr is not None:
+            assert initial_lr > self.warmup_min_lr


nit: You could also add a assert self.warmup_min_lr >= 0 for good measure

Done: 75eba56

2015aroras · 2024-04-11T22:55:44Z

olmo/optim.py

+        if self.warmup_min_lr is not None:
+            assert initial_lr > self.warmup_min_lr
+            return self.warmup_min_lr + (initial_lr - self.warmup_min_lr) * min(step, warmup_steps) / warmup_steps
+        else:


From a cleaner code perspective, maybe it would be better to have just 1 formula and use self.warmup_min_lr to set the starting lr. Something like:

warmup_min_lr = self.warmup_min_lr if self.warmup_min_lr is not None else 0.1 * initial_lr ... return warmup_min_lr + (initial_lr - warmup_min_lr) * min(step, warmup_steps) / warmup_steps

Done: 75eba56

Add cfg option --scheduler.warmup_min_lr

025c89b

For setting the starting LR during the warmup period.

epwalsh requested review from 2015aroras and dwadden April 11, 2024 21:08

dwadden approved these changes Apr 11, 2024

View reviewed changes

2015aroras approved these changes Apr 11, 2024

View reviewed changes

epwalsh added 3 commits April 11, 2024 16:34

clean up

75eba56

fix test

5d2664c

fix

7947b8b

epwalsh merged commit d2afcaa into main Apr 12, 2024
9 of 11 checks passed

epwalsh deleted the epwalsh/warmup-min-lr branch April 12, 2024 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cfg option `--scheduler.warmup_min_lr` #542

Add cfg option `--scheduler.warmup_min_lr` #542

epwalsh commented Apr 11, 2024 •

edited

Loading

dwadden commented Apr 11, 2024

dwadden left a comment

2015aroras left a comment

2015aroras Apr 11, 2024

epwalsh Apr 11, 2024

2015aroras Apr 11, 2024

epwalsh Apr 11, 2024

Add cfg option --scheduler.warmup_min_lr #542

Add cfg option --scheduler.warmup_min_lr #542

Conversation

epwalsh commented Apr 11, 2024 • edited Loading

dwadden commented Apr 11, 2024

dwadden left a comment

Choose a reason for hiding this comment

2015aroras left a comment

Choose a reason for hiding this comment

2015aroras Apr 11, 2024

Choose a reason for hiding this comment

epwalsh Apr 11, 2024

Choose a reason for hiding this comment

2015aroras Apr 11, 2024

Choose a reason for hiding this comment

epwalsh Apr 11, 2024

Choose a reason for hiding this comment

Add cfg option `--scheduler.warmup_min_lr` #542

Add cfg option `--scheduler.warmup_min_lr` #542

epwalsh commented Apr 11, 2024 •

edited

Loading