Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error while loading the pre-trained models during RL training #72

Open
bnaman50 opened this issue Mar 11, 2022 · 0 comments
Open

Error while loading the pre-trained models during RL training #72

bnaman50 opened this issue Mar 11, 2022 · 0 comments

Comments

@bnaman50
Copy link

Hello Chen,

Thanks for providing this code. It seems really helpful for my current research.

However, I am having issues with making this code work. I have setup the environment as suggested but still able to load the pre-trained models (both extractive and abstractive models).

  1. Extractive Model gives error in line assert ext_meta['net'] == 'ml_rnn_extractor'. Looking at the meta.json file, net:rnn-ext_abs_rl. I am not sure why is this discrepancy.
  2. For abstractive model, I face RuntimeError: CUDNN_STATUS_EXECUTION_FAILED error in line self._net = abstractor.to(self._device). I am not sure how to solve this error. I made sure that CUDA is available. Also, it is not the OOM memory as suggested in some of the pages since the GPU memory never exceeds 1 GB.

It would be great if you could help me out.

Thanks,
Naman

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant