Install Nvidia Drivers on azure ml compute instance

Christos Athanasiadis 0 Reputation points
2024-09-26T23:45:37.6333333+00:00

I am new to Azure world and have created a workspace and a machine learning compute instance with an A100 GPU. I have also added a Cuda-base environment to this compute instance. Moreover, I am trying to install PyTorch and Nvidia drivers to run my ML models using GPU support.

However, I am lost on properly installing the Nvidia drivers in my compute instance. I tried to follow these instructions here, but that didn't work for me. When I run the following command:

nvidia-smi  

I am receiving the following error:

NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running

Is there any suggestion on properly installing the Nvidia drivers in my compute instance?

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
2,887 questions
{count} votes

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.