Normalizing the input features can speed up the learning. Normalize t
Something that is more elongated to something that is more normal. Activations A1 A2 etc. Couldn't we normalize the mean and the variance help train more efficiently?
Can we normalize the values of A to train W b fast?
This is what batch normalization does
Normalizing
Given some intermediate values in NN
Compute the mean as follows
and
These 4 equations are calculating the identify function. Where previously you were using
We use
Normalizing the input features can help learning in a neural network. Batch norm applies that same normlization process not just to the input layer, but to values deep within the hidden layer. Depending on how you have made your NN, you may not want your hidden values to be of