Investigate saliency task - is memory leaking? #549

gabegma · 2023-04-19T18:54:36Z

When profiling the memory - I noticed that as we perform a backward pass, the memory used increases and is never released.

Dref360 · 2023-04-19T19:22:01Z

So we should just add hf_pipeline.model.zero_grad() after we unhook the hooks?

https://github.com/ServiceNow/azimuth/blob/main/azimuth/modules/model_contracts/hf_text_classification.py#L169

gabegma · 2023-04-19T20:11:52Z

I tried this and similar things - but I found nothing that would work. But I'm not super knowledgeable of this stuff in general, so I welcome new ideas.

An example here of memory that gets added at the backward pass but never gets released. In every batch, a few MiB gets added. Usually ~100Mib, except for the first pass where it's closer to ~300Mib, as shown here.

I did find that adding with torch.no_grad(): to the prediction tasks helped. Some memory was never released there too. I was planning on committing that, but not sure what to do for custom pipelines that might not be pytorch.

gabegma added this to Azimuth Apr 19, 2023

gabegma converted this from a draft issue Apr 19, 2023

gabegma added the bug Something isn't working label Apr 19, 2023

gabegma mentioned this issue Apr 19, 2023

Assign tasks to workers #525

Merged

4 tasks

gabegma changed the title ~~Investigate saliency task - is memory linking?~~ Investigate saliency task - is memory leaking? Apr 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate saliency task - is memory leaking? #549

Investigate saliency task - is memory leaking? #549

gabegma commented Apr 19, 2023 •

edited

Loading

Dref360 commented Apr 19, 2023

gabegma commented Apr 19, 2023 •

edited

Loading

Investigate saliency task - is memory leaking? #549

Investigate saliency task - is memory leaking? #549

Comments

gabegma commented Apr 19, 2023 • edited Loading

Dref360 commented Apr 19, 2023

gabegma commented Apr 19, 2023 • edited Loading

gabegma commented Apr 19, 2023 •

edited

Loading

gabegma commented Apr 19, 2023 •

edited

Loading