I’m having a tough time getting the NVIDIA drivers to recognize my RTX3060Ti graphics card in an eGPU enclosure (I’ve tried two - the Razer Core X and and Akito Node Titan). I’ve also tried Fedora 37, and pretty much every driver version from 470 to 525. I’m installing using the apt command (sudo apt install nvidia-driver-xxx). I’ve tried the open and regular drivers to no avail.
Looking at the kernel ring buffer output from dmesg, I several failures:
[ 7.735454] NVRM: GPU 0000:04:00.0: RmInitAdapter failed! (0x26:0x56:1253)
[ 7.735549] NVRM: GPU 0000:04:00.0: rminitadapter failed, device minor number 0
[ 7.736179] [drm:nvdrmload [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000400] Failed to allocate NvKmsKapiDevice
[ 7.736604] [drm:nvdrmprobedevices [nvidiadrm]] ERROR [nvidia-drm] [GPU ID 0x00000400] Failed to register device
When I run lspci, the eGPU is found:
04:00.0 VGA compatible controller: NVIDIA Corporation GA104 GeForce RTX 3060 Ti 04:00.1 Audio device: NVIDIA Corporation GA104 High Definition Audio Controller (rev a1)
The eGPU is also listed as Thunderbolt device in the settings.
Googling the errors, it looks like there’s some issue with the NVIDIA linux drivers and the BIOS. I’ve tried every version of the drivers I could find and they all give the same errors.
Anyone have any ideas? If you’ve got the eGPU working in Ubuntu, which drivers did you run & how did you install them?
Fortunately the eGPU seems to work on Windows. I’m mainly using it for CUDA with PyTorch so unfortunately I’m having to live in Windows now, but I’d really like to go back to Ubuntu.
Any help much appreciated!