1
安装cuda_10.0.130_410.48_linux.run自带的410.48 driver成功,但是nvidia-smi
显示
查看NVIDIA显卡驱动版本
cat /proc/driver/nvidia/version
显示驱动安装成功
lspci | grep -i nvidia
显示pci插槽有两块nvidia显卡
2.
/usr/bin/nvidia-uninstall
卸载驱动
然后尝试单独安装驱动
报错:
3.
于是参考: https://linuxcluster.wordpress.com/2018/10/08/nvidia-drivers-issues-unable-to-find-the-kernel-source-tree/
yum install kernel-devel kernel-headers -y
yum info kernel-devel kernel-headers
问题依然存在
发现:kernel-devel kernel-headers 与运行中的kernel版本不同
4.
然后,参考了: https://unix.stackexchange.com/questions/110682/yum-installs-kernel-devel-different-from-my-kernel-version
yum install "kernel-devel-uname-r == $(uname -r)"
yum distro-sync
都不可行
- 前者找不到包
- 后者下载的 kernel-devel kernel-headers仍不匹配
5.
离线安装对应的rpm: https://pkgs.org/download/kernel-headers
https://pkgs.org/download/kernel-devel
6.
再次安装:
报错依旧:
- 接受dkms:
Failed to run
/usr/sbin/dkms build -m nvidia -v 418.74 -k 3.10.0-862.el7.x86_64
: Error! echo
Your kernel headers for kernel 3.10.0-862.el7.x86_64 cannot be found at
/lib/modules/3.10.0-862.el7.x86_64/build or /lib/modules/3.10.0-862.el7.x86_64/source
- 不使用dkms:
7.
- 接受dkms:
发现链接路径需要修改 (路径红色不可达)
重建软链接:
此处参考:https://blog.csdn.net/u012343297/article/details/79141878
rm ./build
ln -s /usr/src/kernels/3.10.0-862.11.6.el7.1.x86_64/ build
8.
done!!!