k8s-gpu-no-nvidia-devices

Community

Restore GPU visibility in Kubernetes pods.

AuthorViktorBarzin
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Kubernetes pods scheduled with GPUs sometimes see no NVIDIA devices inside the container, causing CUDA-enabled workloads to fall back to CPU despite GPU resource requests.

Core Features & Use Cases

  • Diagnose GPU injection issues in Kubernetes clusters and verify device visibility inside pods.
  • Validate NVIDIA device plugin status and perform safe restarts or resource reallocation to recover GPU access.
  • Use cases include resolving "CUDA not supported" or "no devices /dev/nvidia*" errors in GPU-enabled workloads.

Quick Start

Restore NVIDIA device visibility in a failing GPU pod by following the remediation steps.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: k8s-gpu-no-nvidia-devices
Download link: https://github.com/ViktorBarzin/infra/archive/main.zip#k8s-gpu-no-nvidia-devices

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.