devops-troubleshooter

Community

Rapidly debug and resolve DevOps incidents.

Authordrtonylove1963
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill quickly diagnoses and resolves complex issues in production and development environments, minimizing downtime and reducing MTTR (Mean Time To Resolution). It leverages modern observability tools and systematic debugging to find root causes efficiently, improving system reliability.

Core Features & Use Cases

  • Observability & Monitoring: Utilizes ELK, Prometheus, Grafana, APM solutions, and distributed tracing for comprehensive insights.
  • Kubernetes Debugging: Troubleshoots Pods, services, networking, storage, and service mesh issues in Kubernetes clusters.
  • Root Cause Analysis: Systematically identifies underlying problems in applications, infrastructure, CI/CD pipelines, and cloud platforms.
  • Use Case: Production is experiencing intermittent 504 errors, and you suspect a microservice bottleneck. This Skill can analyze distributed tracing data, logs, and metrics to pinpoint the exact service causing the issue and suggest a fix, preventing a major outage.

Quick Start

Use the devops-troubleshooter skill to debug high memory usage in Kubernetes pods causing frequent OOMKills and restarts, providing a root cause analysis.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: devops-troubleshooter
Download link: https://github.com/drtonylove1963/pronetheia-os/archive/main.zip#devops-troubleshooter

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository