chapter 6 - OutOfMemoryError: CUDA out of memory. Tried to allocate 96.00 MiB. GPU #7

amscosta · 2024-09-20T13:56:31Z

Hello,
From the very beginning of chapter 6, when trying to running the jupyter notebook locally with my 8GB VRAM gpu card:

Load model and tokenizer

model = AutoModelForCausalLM.from_pretrained(
"microsoft/Phi-3-mini-4k-instruct",
device_map="cuda",
torch_dtype="auto",
trust_remote_code=True,
)

Is resulting in the message : OutOfMemoryError: CUDA out of memory. Tried to allocate...
Any workaround is very welcome, for instance a less robust model with almost similar results?
Thanks.

jalammar · 2024-09-20T16:46:29Z

Might wanna try a smaller model like Gemma 2B

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chapter 6 - OutOfMemoryError: CUDA out of memory. Tried to allocate 96.00 MiB. GPU #7

chapter 6 - OutOfMemoryError: CUDA out of memory. Tried to allocate 96.00 MiB. GPU #7

amscosta commented Sep 20, 2024

jalammar commented Sep 20, 2024

chapter 6 - OutOfMemoryError: CUDA out of memory. Tried to allocate 96.00 MiB. GPU #7

chapter 6 - OutOfMemoryError: CUDA out of memory. Tried to allocate 96.00 MiB. GPU #7

Comments

amscosta commented Sep 20, 2024

Load model and tokenizer

jalammar commented Sep 20, 2024