No speed-up in my implementation too #7

LSC527 · 2022-05-19T02:36:43Z

I implemented this papaer with torch.autograd.forward_ad. However, fwd gradient showed no speed-up compared to fwd+bwd.

DavideTr8 · 2022-05-23T16:41:01Z

It would be interesting for us to see your implementation as well. If you want, you can make a PR to our repo with your code. So we can have multiple implementations available.

LittleWork123 · 2024-01-18T14:34:28Z

I ran the code from the repository, but I couldn't replicate the results mentioned in the paper, especially regarding the CNN. I used the hyperparameter settings specified in the paper.

May I inquire if there are alternative parameter settings available?

DavideTr8 · 2024-01-20T16:29:32Z

Hi, unfortunately we weren't able to reproduce the same results too.
The hyperparameters we used are the same reported in the paper, but we don't now if alternative hyperparameters settings are available.

We believe that the difference between our implementation and the official one are due to the fact that they did not use functorch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No speed-up in my implementation too #7

No speed-up in my implementation too #7

LSC527 commented May 19, 2022

DavideTr8 commented May 23, 2022

LittleWork123 commented Jan 18, 2024

DavideTr8 commented Jan 20, 2024 •

edited

Loading

No speed-up in my implementation too #7

No speed-up in my implementation too #7

Comments

LSC527 commented May 19, 2022

DavideTr8 commented May 23, 2022

LittleWork123 commented Jan 18, 2024

DavideTr8 commented Jan 20, 2024 • edited Loading

DavideTr8 commented Jan 20, 2024 •

edited

Loading