Feature/grads #4

basavyr · 2024-10-27T10:46:51Z

This PR introduces a comprehensive gradient visualization system for CNN training by implementing automated gradient collection and analysis. The new functionality captures gradients from every layer during the backward pass and generates histograms to visualize their distributions. Throughout the training process, gradients are stored in an efficient data structure, allowing for both real-time monitoring and post-training analysis.

As training progresses, the system generates a sequence of histograms (one per epoch) that illustrate how gradient distributions evolve over time. This visualization capability enables developers to monitor gradient flow, detect potential vanishing or exploding gradient issues, and identify problematic layers that may need attention. The feature can be enabled through a simple configuration flag and has been optimized to minimize its impact on training performance.

This addition will enhance model debugging and optimization workflows by providing clear insights into training dynamics through visual representation of gradient behavior.

basavyr added 14 commits October 13, 2024 18:26

add initial model and its config

6c0ab30

add mnist training and eval data

05ad9c8

add train and eval methods

c637634

add hash to the model config

a94ac2f

return accuracy in eval

4a403d8

save model checkout after training

4accba1

use SGD instead of Adam

c34dae1

add adam optimizer as optional

0d1942d

move fc0 and fc1 into _init_linear_layers

81fd958

add cifar100

f3a1045

add maxpool and batchnorm

f032c5e

use a model for every data type

92c3528

add method to collect all model grads

718d858

add grad plotter

cacc92c

basavyr added the enhancement New feature or request label Oct 27, 2024

basavyr self-assigned this Oct 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/grads #4

Feature/grads #4

basavyr commented Oct 27, 2024

Feature/grads #4

Are you sure you want to change the base?

Feature/grads #4

Conversation

basavyr commented Oct 27, 2024