-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Misc]: issue with loading weights from safetensors files #211
Comments
This is the minimal reproducer code of the torch.device context + safetensors.safe_open runtime error, just in case if you don't have access to the JIRA ticket. |
Hi, we are testing potential fix for that, we will update soon |
@michalkuligowski |
Hi, the fix will be provided in v1.19 |
Hi, this should be working with latest release. Please reopen if issue reoccurs |
Anything you want to discuss about vllm.
While implementing disaggregated prefill, we found an error regarding loading weights from safetensors files. We have filed a JIRA ticket(HS-3164) as we believe this is a synapseAI bug.
However, we found out that the code in vllm-fork is currently doing the same thing: loading safetensors file under torch.device(“hpu”) context, without involving any significant errors.
We’ll be very glad to know what made this possible, please let us know if we are missing something.
FYI, we are using IDC node with synapseAI version 1.17.0.
The text was updated successfully, but these errors were encountered: