Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issues in preprocessing for GLN #10

Open
ZhichaoTan opened this issue Oct 28, 2024 · 1 comment
Open

Issues in preprocessing for GLN #10

ZhichaoTan opened this issue Oct 28, 2024 · 1 comment

Comments

@ZhichaoTan
Copy link

ZhichaoTan commented Oct 28, 2024

Hello, the rxn-ebm project is indeed inspiring. However, when I attempted to preprocess the USPTO-50K data for GLN and run bash scripts/gln_preprocess.sh, the following issue was encountered:

Checking data format before preprocessing
Checking the first 100 entries for each file
Data format check passed
Step 4: building all reactions
loading data info from ./data/gln_schneider50k/processed/cooked_schneider50k/tpl-default
loading templates
11753it [00:00, 263949.06it/s]
# unique templates 11753
# raw train loaded: 39713
loading positive tpls
Traceback (most recent call last):
  File "/home/yuxuan/Zhichao_repo/rxn-ebm/rxnebm/proposer/gln_openretro/preprocess.py", line 97, in <module>
    preprocess_main(args)
  File "/home/yuxuan/Zhichao_repo/rxn-ebm/rxnebm/proposer/gln_openretro/preprocess.py", line 71, in preprocess_main
    processor.preprocess()
  File "/home/yuxuan/Zhichao_repo/rxn-ebm/rxnebm/proposer/gln_openretro/models/gln_model/gln_processor.py", line 129, in preprocess
    self.build_all_reactions()                  # step 4
    ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/yuxuan/Zhichao_repo/rxn-ebm/rxnebm/proposer/gln_openretro/models/gln_model/gln_processor.py", line 432, in build_all_reactions
    DataInfo.load_cooked_part("train", part=pid, load_graphs=False)
  File "/home/yuxuan/Zhichao_repo/rxn-ebm/rxnebm/proposer/gln_openretro/models/gln_model/gln/data_process/data_info.py", line 82, in load_cooked_part
    header = next(reader)
             ^^^^^^^^^^^^
StopIteration
@connorcoley
Copy link
Contributor

It looks like this might originate from the openretro repo instead of the rxn-ebm repo; do you mind re-opening an issue there?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants