lr is different with the original paper #20

ryh95 · 2018-02-01T14:46:51Z

I noticed that lr in code is 0.01 while paper is 0.05 with adagrad, and I tried with 0.05 to train the model but train loss doesn't decrease at all, may be due to the high lr ?

Why you set lr to 0.01 ? And since lr is different, may be there is a bug in code ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lr is different with the original paper #20

lr is different with the original paper #20

ryh95 commented Feb 1, 2018

lr is different with the original paper #20

lr is different with the original paper #20

Comments

ryh95 commented Feb 1, 2018