Any plans for supporting Copilot+PC NPU acceleration (e.g. via ONNX) for Python? #7592

AndreasKunar · 2024-08-01T08:26:25Z

AndreasKunar
Aug 1, 2024

I use SK on the new Copilot+PCs with Windows on ARM (WoA). While llama.cpp runs great on the Snapdragon X CPUs, and even comparable to base Apple Silicon+GPU with their new 2-3x accelerated Q4_0_4_8 quantization, it does not (yet) support its GPU or NPU. Ollama needs custom builds to run on WoA. Semantic Kernel .net supports ONNX.

Any plans to support ONNX from Python and not just .net, so that the arm64 Copilot+PCs can use their NPU from Python / Semantic Kernel?

moonbox3 · 2024-08-02T16:32:22Z

moonbox3
Aug 2, 2024
Maintainer

Hi @AndreasKunar, thanks for your question. Yes, we have plans to support the ONNX AI connector in SK Python. We have a work item tracking this in our backlog. This work should be picked up in a few sprints from now.

1 reply

AndreasKunar Aug 3, 2024
Author

Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any plans for supporting Copilot+PC NPU acceleration (e.g. via ONNX) for Python? #7592

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Any plans for supporting Copilot+PC NPU acceleration (e.g. via ONNX) for Python? #7592

AndreasKunar Aug 1, 2024

Replies: 1 comment · 1 reply

moonbox3 Aug 2, 2024 Maintainer

AndreasKunar Aug 3, 2024 Author

AndreasKunar
Aug 1, 2024

Replies: 1 comment 1 reply

moonbox3
Aug 2, 2024
Maintainer

AndreasKunar Aug 3, 2024
Author