minikube: nvidia-driver-installer addon fails to start (driver fails to install in the container)
The exact command to reproduce the issue: Following the instructions on https://minikube.sigs.k8s.io/docs/tutorials/nvidia_gpu/
Host: Fedora 31 Kernel: 5.5.7-200.fc31.x86_64 Cuda Drivers Host: 10.2 Nvidia Driver Host: Driver Version: 440.64
GPUs:
- Titan V (vfio-pci driver assigned)
- Geforce 1080TI (host GPU nvidia driver)
Minikube start:
minikube start --vm-driver kvm2 --kvm-gpu --cpus=12 --memory=25480
Minikube Addons:
minikube addons enable nvidia-gpu-device-plugin
This will fail initially, just edit the dc/nvidia-gpu-device-plugin and increase the mem to 100mi and it’ll start fine.
The full output of the command that failed:minikube addons enable nvidia-driver-installer
The container fails to start. and once you fetch teh logs, it will return:
Configuring kernel sources... DONE
Running Nvidia installer...
/usr/local/nvidia /
Verifying archive integrity... OK
Uncompressing NVIDIA Accelerated Graphics Driver for Linux-x86_64 390.67.........................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................
ERROR: An error occurred while performing the step: "Building kernel
modules". See /var/log/nvidia-installer.log for details.
ERROR: An error occurred while performing the step: "Checking to see
whether the nvidia-drm kernel module was successfully built". See
/var/log/nvidia-installer.log for details.
ERROR: The nvidia-drm kernel module was not created.
ERROR: The nvidia-drm kernel module failed to build. This kernel module is
required for the proper operation of DRM-KMS. If you do not need to
use DRM-KMS, you can try to install this driver package again with
the '--no-drm' option.
ERROR: Installation has failed. Please see the file
'/usr/local/nvidia/nvidia-installer.log' for details. You may find
suggestions on fixing installation problems in the README available
on the Linux driver download page at www.nvidia.com.
The output of the minikube logs command:
The minikube logs don’t say much but here they are: minikube-logs.txt
Logs from the nvidia-installer itself: nvidia-driver-installer-logs.txt
The operating system version:
About this issue
- Original URL
- State: closed
- Created 4 years ago
- Reactions: 2
- Comments: 24 (2 by maintainers)
I’m setting up minikube with the kvm2 driver and gpu passthrough as per https://minikube.sigs.k8s.io/docs/tutorials/nvidia_gpu/. GPU passthrough is available to the VM (the PCI-E devices are available as hardware to the minikube VM), but I’m failing to install the nvidia drivers inside the VM via
The nvidia-gpu-device-plugin pod is listed as running, whereas the second pod is stuck on Init (paused). I’m guessing this is the same issue @Nick-Harvey encountered.
kubectl get pods -n kube-systemkubectl logs nvidia-gpu-device-plugin-57n2l -n kube-systemSome additional info on the pods:
`kubectl describe po nvidia-gpu-device-plugin-57n2l -n kube-system`
`kubectl describe po nvidia-driver-installer-c5bvd -n kube-system`
Any suggestions on how to debug this further?
Hello, I’ve got similar issue: looks like addon uses very old NVIDIA GPU driver 390.67 which doesn’t support RTX 3090/3080. How can I install the addon with the new NVIDIA driver?