View previous topic :: View next topic |
Author |
Message |
SkunkMyrddyn n00b
Joined: 25 Dec 2024 Posts: 3
|
Posted: Wed Dec 25, 2024 4:04 am Post subject: Nvidia Datacenter Driver |
|
|
I'm adding a Nvidia Tesla A2 card to my server to support Cuda / Tensor flow / other AI and compute node acceleration. (the card does not have video out connections)
I am having a difficult time installing the correct driver for the system. The general "nvidia-drivers" package 1) requires X (this is a headless server), and 2) does not list this card as supported (if I'm reading the documentation correctly).
Does anyone know how to get the correct driver(s) installed so that pytorch can recognize the nvidia compute nodes for acceleration? |
|
Back to top |
|
|
tiffany n00b
Joined: 04 May 2008 Posts: 11
|
Posted: Wed Dec 25, 2024 9:57 am Post subject: |
|
|
NVidia's site has a separate section for datacenter drivers. Have you seen them?
I see that they support RHEL, Debian and others. |
|
Back to top |
|
|
SkunkMyrddyn n00b
Joined: 25 Dec 2024 Posts: 3
|
Posted: Wed Dec 25, 2024 4:38 pm Post subject: |
|
|
I checked those out and wasn't sure how to convince gentoo to handle one of the other packaging formats. So I did grab the tarballs they have, which have a nvidia-installer binary; but I can't get that to run either.
I found that it has a --no-x-check that will bypass seeing if X (of some kind) is installed or not.
However, the installer errors out saying it cannot figure out my initramfs. Which makes sense as I am not using an initramfs at all on this system. Nor do I see an option to inform that installer to bypass it.
I feel like I'm missing something basic. |
|
Back to top |
|
|
Banana Moderator
Joined: 21 May 2004 Posts: 1801 Location: Germany
|
|
Back to top |
|
|
SkunkMyrddyn n00b
Joined: 25 Dec 2024 Posts: 3
|
Posted: Thu Dec 26, 2024 10:11 am Post subject: |
|
|
The nvidia-cuda-toolkit doesn't install a driver, so python torch does not find any cuda devices.
With -X set as a USE flag blocks the x11-drivers/nvidia-drivers from installing. |
|
Back to top |
|
|
Hu Administrator
Joined: 06 Mar 2007 Posts: 22874
|
Posted: Thu Dec 26, 2024 12:02 pm Post subject: |
|
|
SkunkMyrddyn wrote: | With -X set as a USE flag blocks the x11-drivers/nvidia-drivers from installing. | Please show the output that led to this statement. I do not see that result here: Code: | # USE=-X emerge -pv nvidia-drivers
These are the packages that would be merged, in order:
Calculating dependencies... done!
Dependency resolution took 2.59 s (backtrack: 0/20).
...
[ebuild N ] x11-drivers/nvidia-drivers-550.135:0/550::gentoo USE="modules strip tools -X -dist-kernel -kernel-open -modules-compress -modules-sign -persistenced -powerd -static-libs -wayland" ABI_X86="(64) -32" 314787 KiB
|
|
|
Back to top |
|
|
|