Error while loading the pre-trained models during RL training #72

bnaman50 · 2022-03-11T17:20:05Z

Hello Chen,

Thanks for providing this code. It seems really helpful for my current research.

However, I am having issues with making this code work. I have setup the environment as suggested but still able to load the pre-trained models (both extractive and abstractive models).

Extractive Model gives error in line assert ext_meta['net'] == 'ml_rnn_extractor'. Looking at the meta.json file, net:rnn-ext_abs_rl. I am not sure why is this discrepancy.
For abstractive model, I face RuntimeError: CUDNN_STATUS_EXECUTION_FAILED error in line self._net = abstractor.to(self._device). I am not sure how to solve this error. I made sure that CUDA is available. Also, it is not the OOM memory as suggested in some of the pages since the GPU memory never exceeds 1 GB.

It would be great if you could help me out.

Thanks,
Naman

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error while loading the pre-trained models during RL training #72

Error while loading the pre-trained models during RL training #72

bnaman50 commented Mar 11, 2022

Error while loading the pre-trained models during RL training #72

Error while loading the pre-trained models during RL training #72

Comments

bnaman50 commented Mar 11, 2022