You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I need to train a simple GCN on the ogbg-molbace dataset, aiming for reproducible results. I've made slight modifications to your main_pyg.py script (from this source) and manually set a seed before defining the model. However, I've noticed that running the script multiple times yields different performances. For clarity, I'm including the performances of a GCN on ogbg-molhiv, as the leaderboard only reports GCN performances on this dataset. Across 10 runs, the results show a ~1% of std, which should be weird considering that the models have the same weights initialization.
Since I'm using the default hyperparameters, I think the issue may be related to model convergence. Could you please provide the hyperparameters used to train a GCN? The default model in the script is GIN, so I assume the provided hyperparameters are intended for it.
python main_pyg.py --gnn "gcn"
100%|██████████| 100/100 [32:19<00:00, 19.39s/it]
Finished training!
Best epoch: 38
Best validation score: 0.8119886586321772
Test score: 0.7545201722706116
python main_pyg.py --gnn "gcn"
100%|██████████| 100/100 [32:15<00:00, 19.35s/it]
Finished training!
Best epoch: 51
Best validation score: 0.8196250979815796
Test score: 0.7756580853241661
python main_pyg.py --gnn "gcn"
100%|██████████| 100/100 [32:24<00:00, 19.45s/it]
Finished training!
Best epoch: 49
Best validation score: 0.8157303301979227
Test score: 0.7535989493810231
python main_pyg.py --gnn "gcn"
100%|██████████| 100/100 [32:42<00:00, 19.63s/it]
Finished training!
Best epoch: 73
Best validation score: 0.8068874926513814
Test score: 0.7652619787944919
python main_pyg.py --gnn "gcn"
100%|██████████| 100/100 [32:27<00:00, 19.48s/it]
Finished training!
Best epoch: 51
Best validation score: 0.8044425460513424
Test score: 0.7589679213580796
python main_pyg.py --gnn "gcn"
100%|██████████| 100/100 [32:17<00:00, 19.37s/it]
Finished training!
Best epoch: 28
Best validation score: 0.819634283754654
Test score: 0.7728866915158655
python main_pyg.py --gnn "gcn"
100%|██████████| 100/100 [31:45<00:00, 19.06s/it]
Finished training!
Best epoch: 51
Best validation score: 0.8099769743288262
Test score: 0.7803182757488556
python main_pyg.py --gnn "gcn"
100%|██████████| 100/100 [31:45<00:00, 19.06s/it]
Finished training!
Best epoch: 24
Best validation score: 0.8087368949637468
Test score: 0.7654087564456633
python main_pyg.py --gnn "gcn"
100%|██████████| 100/100 [31:44<00:00, 19.05s/it]
Finished training!
Best epoch: 71
Best validation score: 0.8040307172251616
Test score: 0.7746557484694568
python main_pyg.py --gnn "gcn"
100%|██████████| 100/100 [31:46<00:00, 19.07s/it]
Finished training!
Best epoch: 63
Best validation score: 0.80362041936116
Test score: 0.7799976824581394
Hi,
I need to train a simple GCN on the
ogbg-molbace
dataset, aiming for reproducible results. I've made slight modifications to yourmain_pyg.py
script (from this source) and manually set a seed before defining the model. However, I've noticed that running the script multiple times yields different performances. For clarity, I'm including the performances of a GCN onogbg-molhiv
, as the leaderboard only reports GCN performances on this dataset. Across 10 runs, the results show a ~1% of std, which should be weird considering that the models have the same weights initialization.Since I'm using the default hyperparameters, I think the issue may be related to model convergence. Could you please provide the hyperparameters used to train a GCN? The default model in the script is GIN, so I assume the provided hyperparameters are intended for it.
The text was updated successfully, but these errors were encountered: