Connect `flatten`, `conv2d`, and `maxpool2d` layers in backward pass #142

milancurcic · 2023-06-15T18:58:10Z

cnn_mnist was previously converging to a low ~93% accuracy solution because the convolutional layers were disconnected in the backward pass, so only the output dense layer was being trained. This PR is a WIP that connects these layers. Now that they're connected, cnn_mnist doesn't converge due to not yet uncovered issues in the backward passed of conv2d and possibly maxpool2d layers as well.

certik · 2024-03-21T01:49:14Z

Git bisect revealed this PR to drop the training from > 90% to ~10% (#145 (comment)), however based on this PR description it seems this PR fixed some issues and there are more issues to fix.

Was there ever a time when the training fully worked?

milancurcic · 2024-03-21T12:29:35Z

Good question. There was a time when I thought it was converging (although not at the expected level, e.g. 96% or so accuracy, but rather at mid-80% IIRC) because I wrote a poor test. However, I think 2-d CNN training never worked correctly; it's implemented and likely a bug or two away from working but I haven't made it a priority to fix it yet. We know that inference works because we can load a pre-trained Keras CNN and infer with high accuracy. So the bug(s) is somewhere in the backward pass of one or more of conv2d, maxpool2d, flatten, and/or reshape layers.

Connect flatten, conv2d, and maxpool2d layers in backward pass

a462bf6

milancurcic added the bug Something isn't working label Jun 15, 2023

milancurcic self-assigned this Jun 15, 2023

milancurcic mentioned this pull request Jun 21, 2023

SGD optimizer stub #139

Merged

milancurcic added 2 commits June 22, 2023 11:22

Merge branch 'main' into fix-conv2d-backprop

5120d4a

Bump minor version

37832e4

milancurcic merged commit 6bbc28d into modern-fortran:main Jun 22, 2023

milancurcic deleted the fix-conv2d-backprop branch June 22, 2023 15:27

certik mentioned this pull request Mar 21, 2024

CNN training on MNIST does not converge #145

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Connect `flatten`, `conv2d`, and `maxpool2d` layers in backward pass #142

Connect `flatten`, `conv2d`, and `maxpool2d` layers in backward pass #142

milancurcic commented Jun 15, 2023

certik commented Mar 21, 2024

milancurcic commented Mar 21, 2024

Connect flatten, conv2d, and maxpool2d layers in backward pass #142

Connect flatten, conv2d, and maxpool2d layers in backward pass #142

Conversation

milancurcic commented Jun 15, 2023

certik commented Mar 21, 2024

milancurcic commented Mar 21, 2024

Connect `flatten`, `conv2d`, and `maxpool2d` layers in backward pass #142

Connect `flatten`, `conv2d`, and `maxpool2d` layers in backward pass #142