update inference ami version in sagemaker endpoint config to fix nvml driver issue #104

kshitizgupta21 · 2024-11-18T21:19:23Z

This PR updates the nim_llama3.ipynb jupyter notebook to use 'InferenceAmiVersion': 'al2-ami-sagemaker-inference-gpu-2' within ProductionVariants inside EndpointConfig. This will make sure that newer driver is used on g5 and p4d/p4de instances instead of default 470 one and solve pynvml driver issues. It should fix this issue #98

cc: @JamesMaki @abhisheksawarkar

… driver issues

… driver issues (NVIDIA#104)

update inference ami version in sagemaker endpoint config to fix nvml…

1788973

… driver issues

JamesMaki approved these changes Nov 19, 2024

View reviewed changes

JamesMaki merged commit 0861539 into NVIDIA:main Nov 19, 2024

saurabh-nvidia pushed a commit to saurabh-nvidia/nim-deploy that referenced this pull request Jan 11, 2025

update inference ami version in sagemaker endpoint config to fix nvml…

29f33cc

… driver issues (NVIDIA#104)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update inference ami version in sagemaker endpoint config to fix nvml driver issue #104

update inference ami version in sagemaker endpoint config to fix nvml driver issue #104

kshitizgupta21 commented Nov 18, 2024

update inference ami version in sagemaker endpoint config to fix nvml driver issue #104

update inference ami version in sagemaker endpoint config to fix nvml driver issue #104

Conversation

kshitizgupta21 commented Nov 18, 2024