-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question on Hermite Polynomials Coefficients #1
Comments
Also, when you use |
Hi @elony314 , Thanks for your questions. Coefficients of Hermite Polynomial ActivationsThe n^{th} coefficient of the hermite activation is computed by calculating the inner product between the ReLU function and the n^{th} normalized hermite polynomial. Our objective is to decompose ReLU function using Hermite Polynomial Basis. Please refer to the Section 4 of arXiv:1711.00501 for the definition of the inner product and a nice primer on Hermite functions. The foot-note on Page 5 of arXiv:1711.00501 provides the precise coefficient values for different 'n', which is what we use in the code. .item(), the gradient flow is cut offThanks for pointing this out. In our model, we set the coefficients of hermite polynomials as trainable parameters . The code that is linked in the question was not used for training but to compute the time and cost of running the script for a "single" epoch, hence, we did not require gradients to be updated. I have added a new "activations.py" file that was used for training and renamed the existing file as "activations_timing.py" to resolve confusions. This new "activations.py" file was earlier made available in all the other directories such as 1-deep_autoencoder, 2-supervised_setting and 4-WhyHermitesProvideFasterConvergence. |
We use probabilists' Hermite polynomials. You have computed physicists' Hermite polynomials. |
Thank you for this interesting paper. However, as I go through the paper and code, there is something that I am not very sure about. Based on my understanding, followings are the normalized Hermite polynomial from degree 0 to 3 (i.e. h_0 to h_3).
data:image/s3,"s3://crabby-images/22f11/22f11cf519ff02b6871d2d4f5eb78c02a33f0c56" alt="$_l_1_2_x_2_sqr"
However, after reading your code, I am a little confused. In the following section of your code, the coefficient of the hermite polynomial doesn't match.
DeepHermites/Code/3-semisupervised_setting/aws_costestimates/epoch_measurements/mnist/4hermites_v2l_1epochs_w/lib/activations.py
Lines 48 to 54 in 03b2c8a
The text was updated successfully, but these errors were encountered: