Added RMSProp Optimizer subroutine #144

Spnetic-5 · 2023-06-16T14:14:18Z

Solves #136
This pull request adds an implementation of the RMSprop optimizer subroutine to the existing quadratic example.

Approach:

Initialized rms_weights and rms_gradients arrays of appropriate dimensions.
Added a nested loop over the network layers to update the weights using the RMSprop update rule.
Calculated rms_weights and rms_gradients using the decay rate and current weights/gradients.
Updated the weights using the RMSprop update rule: weights = weights - (learning_rate / sqrt(rms_weights + epsilon)) * gradients.

example/quadratic.f90

Spnetic-5 · 2023-06-18T22:28:29Z

Apologies for late reply @milancurcic

milancurcic · 2023-06-20T15:42:37Z

@Spnetic-5 I mostly re-wrote the subroutine so that now it compiles and converges. It's not using mini-batching; for simplicity, for now it's being applied after the entire batch of forward and backward passes.

I understand that this PR was challenging. It took me a bit to find the right approach. In your most recent commit, you made some changes and wrote "made suggested corrections", which made it sound like the PR was good to go. However, the example was not even compiling at this stage. Whenever you struggle with the implementation, please write a comment in the PR explaining where you got stuck and if you need help, rather than just leaving it with a short commit message.

Also please study the implementation in this PR. It introduces a new derived type to allow tracking a moving average of gradients over multiple epochs and for each layer. We are likely to use this approach for other optimizers that need a moving average logic.

Spnetic-5 · 2023-06-20T17:01:57Z

I apologize for the confusion caused by my commit message. It was not my intention to imply that the code was ready to go. But the code was compiling and running well in my PC, I'll make sure to provide detailed comments in the pull request in the future.

Thanks for the changes, I'll study those.

milancurcic · 2023-06-20T19:26:04Z

Thank you, @Spnetic-5, and no worries. I apologize for jumping the gun and finishing the implementation in this PR.

Going forward, would you like to give it a shot to continue the work in #139, or would you like to implement another optimizer in the quadratic fit example program? Recall that once we implement #139 for SGD, the new optimizers in quadratic will serve as prototype implementations for porting them into the library.

Spnetic-5 · 2023-06-21T00:43:08Z

Going forward, would you like to give it a shot to continue the work in #139, or would you like to implement another optimizer in the quadratic fit example program? Recall that once we implement #139 for SGD, the new optimizers in quadratic will serve as prototype implementations for porting them into the library.

Thank you, @milancurcic. I would like to work on #137 first. Once we have completed that, we can move on to #139 and then additional new optimizers.

Added RMSProp subroutine

6efb8c4

Spnetic-5 requested a review from milancurcic June 16, 2023 14:14

milancurcic reviewed Jun 16, 2023

View reviewed changes

example/quadratic.f90 Outdated Show resolved Hide resolved

example/quadratic.f90 Outdated Show resolved Hide resolved

Added Bias gradients

4799fd6

Spnetic-5 requested a review from milancurcic June 18, 2023 22:12

Made suggested corrections

da36a65

Refactor RMSprop; it now compiles and converges

642c07a

milancurcic merged commit 44833c2 into modern-fortran:main Jun 20, 2023

milancurcic mentioned this pull request Jun 20, 2023

Implement RMSprop in example/quadratic.f90 #136

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added RMSProp Optimizer subroutine #144

Added RMSProp Optimizer subroutine #144

Spnetic-5 commented Jun 16, 2023 •

edited

Loading

Spnetic-5 commented Jun 18, 2023

milancurcic commented Jun 20, 2023

Spnetic-5 commented Jun 20, 2023 •

edited

Loading

milancurcic commented Jun 20, 2023

Spnetic-5 commented Jun 21, 2023

Added RMSProp Optimizer subroutine #144

Added RMSProp Optimizer subroutine #144

Conversation

Spnetic-5 commented Jun 16, 2023 • edited Loading

Spnetic-5 commented Jun 18, 2023

milancurcic commented Jun 20, 2023

Spnetic-5 commented Jun 20, 2023 • edited Loading

milancurcic commented Jun 20, 2023

Spnetic-5 commented Jun 21, 2023

Spnetic-5 commented Jun 16, 2023 •

edited

Loading

Spnetic-5 commented Jun 20, 2023 •

edited

Loading