oonxruntime-gpu: Reduce overall project size by levearging oonx cpu and gpu runtimes #71

arky · 2025-02-20T12:25:59Z

The current overall deployment of AddaxAI requires 4GB install file and 20 GB of free disk space. The most of the space is taken by python environments.

Python projects are now leverage on oonxruntime (https://pypi.org/project/onnxruntime-gpu/) packages to directly models. This issue is filed to explore the possibility of exploring this new feature to reduce the footprint of AddaxAI

PetervanLunteren · 2025-03-03T12:53:02Z

Thanks for sharing @arky! Do I understand correctly that onnxruntime-gpu only works for ONNX format models?

arky · 2025-03-03T14:48:50Z

@PetervanLunteren Yes, onnxruntime-[cpu,gpu] is specific to Microsoft ONNX format.

PetervanLunteren added the in progress label Mar 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

oonxruntime-gpu: Reduce overall project size by levearging oonx cpu and gpu runtimes #71

oonxruntime-gpu: Reduce overall project size by levearging oonx cpu and gpu runtimes #71

arky commented Feb 20, 2025

PetervanLunteren commented Mar 3, 2025

arky commented Mar 3, 2025

oonxruntime-gpu: Reduce overall project size by levearging oonx cpu and gpu runtimes #71

oonxruntime-gpu: Reduce overall project size by levearging oonx cpu and gpu runtimes #71

Comments

arky commented Feb 20, 2025

PetervanLunteren commented Mar 3, 2025

arky commented Mar 3, 2025