Can we have LLMs loaded on multiple GPUs via the HuggingFacePipeline.from_model_id function? #4965

kiran-paul98 · 2023-05-19T05:47:50Z

kiran-paul98
May 19, 2023

I am trying to load mosaicml's mpt 7b by using the HuggingFacePipeline.from_model_id function using the device parameter. But I am not able to load the entire model into one GPU due to Out of Memory error. I have 2 GPUs. Is it possible to load the model into multiple GPUs? I know that currently device parameter takes only integer arguments. Is there a workaround so that we can pass multiple GPU devices to the function, so that models can be loaded?

j-o-h-n-w-e-i-s · 2023-06-16T20:34:59Z

j-o-h-n-w-e-i-s
Jun 16, 2023

Is there anything to be gained in ingest, embeddings or chatting from just on GPU?

Are the instructions to get one GPU utilized pretty simple? E.G.

install GPU Drivers
install CUDA tookit
Instal cuDNN
Install GPU-enabled deep learning frameworks, tensorflow, torch torchvision torchaudio
configure privateGPT to utilize GPU
Verify GPU usage

Am I missing anything?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can we have LLMs loaded on multiple GPUs via the HuggingFacePipeline.from_model_id function? #4965

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Can we have LLMs loaded on multiple GPUs via the HuggingFacePipeline.from_model_id function? #4965

kiran-paul98 May 19, 2023

Replies: 1 comment

j-o-h-n-w-e-i-s Jun 16, 2023

kiran-paul98
May 19, 2023

j-o-h-n-w-e-i-s
Jun 16, 2023