volcano-queue-diagnose

Official

Diagnose Volcano Queue bottlenecks.

Authorscitix
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill helps SREs and DevOps engineers understand and troubleshoot resource allocation issues within the Volcano batch scheduling system, preventing job starvation and optimizing cluster utilization.

Core Features & Use Cases

  • Queue Status Monitoring: Provides an overview of all Volcano queues, their weights, states, and resource allocation.
  • Resource Bottleneck Identification: Pinpoints queues that are over-allocated, nearing capacity, or have high numbers of pending jobs.
  • Detailed Analysis: Offers in-depth views of queue specifications, status fields, and associated PodGroups.
  • Use Case: A critical machine learning training job is stuck in a pending state. This Skill can quickly reveal if the job's assigned queue is oversubscribed, has insufficient weight, or is in a closed state, guiding the user to the root cause.

Quick Start

Run the volcano-queue-diagnose skill to analyze all current Volcano queues for potential scheduling bottlenecks.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: volcano-queue-diagnose
Download link: https://github.com/scitix/siclaw/archive/main.zip#volcano-queue-diagnose

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.