Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU memory leak. #55

Open
epbsb opened this issue Aug 10, 2023 · 3 comments
Open

GPU memory leak. #55

epbsb opened this issue Aug 10, 2023 · 3 comments

Comments

@epbsb
Copy link

epbsb commented Aug 10, 2023

Hello,

I'm using pycave for a project where the data is unidimensional of size 8e9. The GPU options works well, and I'm splitting the data in "for loops" to do the predictions. However, as the loops goes on, it takes more and more of the GPU memory, and eventually runs out of memory. To contour this issue, I'm using torch clean cache at each interaction in addition to the garbage collector function in python, as shown in the code below, however this process is slow.

import gc
def clear_gpu_memory():
    torch.cuda.empty_cache()
    gc.collect()
    torch.cuda.empty_cache()

I've tried to use the pycave built-in function of batches as well, but it also runs in the memory issue.

Is there anything I could do to fix this?

@epbsb epbsb changed the title GPU memory overflow. GPU memory leak. Aug 10, 2023
@borchero
Copy link
Owner

I haven't seen this in the past and don't currently have a GPU available for testing, unfortunately 😕

@epbsb
Copy link
Author

epbsb commented Aug 15, 2023

I "fixed" the problem. When I install pycave it forces an old installation of PyTorch (1.12) with the "torchkit" dependency. After that, I reinstall the latest 2.01 version of PyTorch and there are no more memory leaks!

I can use the batch function normally!!

@epbsb
Copy link
Author

epbsb commented Sep 21, 2023

@borchero Since my last message I noticed something, in the poetry.lock file you have this:

[package.dependencies]
numpy = ">=1.20.0,<2.0.0"
pytorch-lightning = ">=1.8,<1.13"

Which I belive forces the install of the older version of PyTorch (1.12).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants