-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About the implentation of .cpu() #96
Comments
Thanks! It should be similar to Lines 472 to 535 in b1a7c06
RIght now it is a mess because we support quantizing the scale/zero values and support offloading them to the cpu. I think in the future we are gonna remove this which should make things much easier: #93 (comment) May I ask why would need the |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Thanks for your work, may I ask when you expect to implement the .cpu() method of HQQLinear? Or can you please briefly describe how to implement it, I can implement it myself and submit a PR:
hqq/hqq/core/quantize.py
Line 563 in b1a7c06
The text was updated successfully, but these errors were encountered: