GPU offloading proper number #252

alexjp · 2025-03-04T01:36:17Z

Hi,

I have a model that I usually have to use in llama-cpp because of the proper number of GPU offloading.

Example, in the model in the image below, LM Studio allows me in configuration to select GPU offloading from 0 to 40.
If I select 40, I see in the logs, that it offloaded 41/41 layers to GPU.
If I select 39 in the configuration, it shows in the logs that it offloaded 39/41 layers to GPU.

How do I offload 40/41 layers to the GPU? I can do this in llama-cpp directly.

Thanks for LM Studio!

yagil · 2025-03-04T01:37:37Z

Good question. cc @mattjcly

yagil added the needs-investigation label Mar 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU offloading proper number #252

GPU offloading proper number #252

alexjp commented Mar 4, 2025

yagil commented Mar 4, 2025

GPU offloading proper number #252

GPU offloading proper number #252

Comments

alexjp commented Mar 4, 2025

yagil commented Mar 4, 2025