-
Notifications
You must be signed in to change notification settings - Fork 4
Open
Description
Containers running on nodes that support NVIDIA drivers should use the nvidia-container-toolkit to provide proper driver integration. This feature should include
- Admin documentation for
- Installing
nvidia-driverfrom official Debian repos - Installing
nvidia-container-toolkitfrom NVIDIA sources (link to NVIDIA docs) - Configuring the /usr/share/lxc/hooks/nvida hook script for API use by symlinking to /var/lib/vz/snippets (idk is this the best way?)
- Installing
- Container creator updates to
- Identify NVIDIA nodes (boolean in Nodes model? autodetected based on hook script presence?)
- Add the
NVIDIA_VISIBLE_DEVICES=allandNVIDIA_DRIVER_CAPABILITIES=utility computeenvironment variables + the hook script to containers created on NVIDIA nodes. - Boolean in Containers model "GPU Required" to enforce being created on a Node with a GPU? (Unnessecary if GPU Nodes are in their own sites, but nessecary if we have mixed sites, would require the boolean in the Nodes model rather than autodetection)
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels