You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Nos is currently broken on systems where GPU drivers are pre-installed on hosts, for example on AKS. The symptom is gpu-agent Pod not starting due to missing /run/nvidia path on host.
According to Nvidia DRA driver documentation, /run/nvidia folder is provided via driver container. When drivers are installed on host instead of via container, the path is missing and has to be symlinked to host root manually:
Ensure your NVIDIA driver installation is rooted at /run/nvidia/driver
For deployments running a driver container this is a noop.
The driver container should already mount the driver installation at /run/nvidia/driver.
For deployments running with a host-installed driver, the following is sufficient to meet this requirement:
Nos is currently broken on systems where GPU drivers are pre-installed on hosts, for example on AKS. The symptom is
gpu-agent
Pod not starting due to missing/run/nvidia
path on host.According to Nvidia DRA driver documentation,
/run/nvidia
folder is provided via driver container. When drivers are installed on host instead of via container, the path is missing and has to be symlinked to host root manually:To implement support for host-installed drivers, we can simply mount host's
/
as/run/nvidia/driver
inside gpu-agent container.The text was updated successfully, but these errors were encountered: