Switch from AutoGPTQ to GPTQModel #6929

ntoxeg · 2025-02-13T14:00:30Z

ntoxeg
Feb 13, 2025

Currently when trying to export a quantized model LLaMA Factory requires auto_gptq to be installed.
As per the project's page:

🚨 AutoGPTQ development has stopped. Please switch to GPTQModel as drop-in replacement. 🚨

The problem I have because of this is that I can't build AutoGPTQ with the latest Pytorch in a CUDA 12.8 environment, as it gives a compilation error. On the other hand, I am able to build GPTQModel in that environment. Also, it seems LLaMA Factory tries to import auto_gptq regardless of the quantization method chosen in the config.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch from AutoGPTQ to GPTQModel #6929

{{title}}

Replies: 0 comments

Select a reply

Switch from AutoGPTQ to GPTQModel #6929

ntoxeg Feb 13, 2025

Replies: 0 comments

ntoxeg
Feb 13, 2025