devops:gpu-analysis
CommunityAnalyze GPU cluster usage and health.
AuthorChanghwanK
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides a comprehensive analysis of GPU cluster usage and health, helping you understand resource allocation, identify bottlenecks, and determine model deployment feasibility.
Core Features & Use Cases
- Cluster Overview: Get a real-time snapshot of GPU node status, VRAM utilization, power consumption, and temperature.
- Resource Allocation: Analyze how workloads are mapped to GPUs and identify underutilized or overutilized resources.
- Deployment Feasibility: Assess if your models can be deployed based on VRAM availability and requirements.
- Troubleshooting: Detect nodes with high load or abnormal conditions.
Quick Start
Analyze the current GPU cluster status and VRAM usage for the k8s-idc context.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: devops:gpu-analysis Download link: https://github.com/ChanghwanK/dotfiles/archive/main.zip#devops-gpu-analysis Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.