-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
“No backend type associated with device type cpu” when run cli_demo_sat.py #173
Comments
如果想用cpu运行,请确保CUDA_VISIBLE_DEVICES=空 |
是想用GPU运行的,但是有quant 8,所以AutoModel.from_pretrained()一开始是在CPU上吧 |
quant 8暂时不支持overwrite_arge={'model_parallel_size'} |
那难道是我记错了。。。所以目前quant是不支持多卡推理的是吧 另外我改成bf16报错
|
是的,因为quant切分的状态我也不知道怎么均分到不同卡上……取决于量化算法 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
原来能跑起来的,现在又不行了,是sat又更新了吗?
目前版本torch=2.1.2,sat=0.4.11,transformers=4.38.2
The text was updated successfully, but these errors were encountered: