SGD optimizer stub #139

milancurcic · 2023-06-13T16:53:29Z

First attempt at defining the concrete optimizer procedure as a method of SGD optimizer type.

Currently defining the minimize subroutine as elemental to allow a scalar/array/rank-agnostic interface. It's possible that this won't work for all cases if we discover new requirements but let's try this for the time being.

…k % update()

milancurcic · 2023-06-21T21:01:23Z

This now works. There's an API change to the network % train() and network % update() methods which now require an argument of class(optimizer_base_type). (I wonder if it's possible to make this optional so we can default to sgd).

Once an optimizer is passed to network % update, it's passed to layer % update() for all layers. In layer % update, the weights and biases are accessed from the internal layer representation and passed to optimizer % minimize(). I borrowed the name minimize from Keras. optimizer % optimize() would be appropriate but sounds weird due to repetition. How about optimizer % update()?

The optimizer step for conv2d is currently not implemented but it may be easy to do so even in this PR (but convolutional training is broken anyway as explained in #142).

Spnetic-5 · 2023-06-22T01:48:22Z

Thanks for bringing up the API change regarding the network methods. Making the optimizer argument optional and setting up SGD as default sounds like a good idea.

Regarding the naming, I think optimizer % minimize() is good, as it captures the essence of the operation. Furthermore, I'll study all the updations in the code.

milancurcic added 3 commits June 13, 2023 12:48

Defining the SGD minimization step in the optimizer type

677dc5e

Add note about refactor needed

75a3f9c

Pass optimizer instance down to layer % update()

cf904d7

milancurcic mentioned this pull request Jun 20, 2023

Added RMSProp Optimizer subroutine #144

Merged

milancurcic added 3 commits June 21, 2023 12:38

Merge branch 'main' into sgd-optimizer-stub

523fbc2

Apply the optimizer update step in layer % update

23add45

Changes in tests and examples to account for the API change in networ…

84e5a17

…k % update()

milancurcic marked this pull request as ready for review June 21, 2023 20:55

milancurcic added the enhancement New feature or request label Jun 21, 2023

milancurcic requested a review from Spnetic-5 June 21, 2023 20:56

milancurcic added 2 commits June 22, 2023 10:58

Make optimizer optional; default to SGD with learning rate of 1

74636f9

Apply optimizer to conv2d layer

7e36eb2

milancurcic merged commit 31fc061 into modern-fortran:main Jun 22, 2023

milancurcic deleted the sgd-optimizer-stub branch June 22, 2023 15:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SGD optimizer stub #139

SGD optimizer stub #139

milancurcic commented Jun 13, 2023

milancurcic commented Jun 21, 2023

Spnetic-5 commented Jun 22, 2023 •

edited

Loading

SGD optimizer stub #139

SGD optimizer stub #139

Conversation

milancurcic commented Jun 13, 2023

milancurcic commented Jun 21, 2023

Spnetic-5 commented Jun 22, 2023 • edited Loading

Spnetic-5 commented Jun 22, 2023 •

edited

Loading