devops:gpu-analysis

Community

Analyze GPU cluster usage and health.

AuthorChanghwanK
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides a comprehensive analysis of GPU cluster usage and health, helping you understand resource allocation, identify bottlenecks, and determine model deployment feasibility.

Core Features & Use Cases

  • Cluster Overview: Get a real-time snapshot of GPU node status, VRAM utilization, power consumption, and temperature.
  • Resource Allocation: Analyze how workloads are mapped to GPUs and identify underutilized or overutilized resources.
  • Deployment Feasibility: Assess if your models can be deployed based on VRAM availability and requirements.
  • Troubleshooting: Detect nodes with high load or abnormal conditions.

Quick Start

Analyze the current GPU cluster status and VRAM usage for the k8s-idc context.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: devops:gpu-analysis
Download link: https://github.com/ChanghwanK/dotfiles/archive/main.zip#devops-gpu-analysis

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.