investigate-alert

Community

Triage firing Prometheus alerts instantly.

Authordmzoneill
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill rapidly triages firing Prometheus alerts by gathering essential context like pod health, events, and logs, enabling quick assessment and escalation.

Core Features & Use Cases

  • Alert Triage: Automatically fetches firing alerts from Prometheus.
  • System Health Check: Assesses Kubernetes pod health, resource usage, and recent events.
  • Log Analysis: Searches Kibana for relevant errors and exceptions.
  • Pattern Matching: Compares issues against known patterns for faster diagnosis.
  • Escalation: Automatically escalates critical alerts to debug_prod if necessary.
  • Use Case: When a critical alert fires, this Skill provides a comprehensive overview of the system's status, helping to pinpoint the root cause without manual intervention.

Quick Start

Investigate the firing Prometheus alerts for the 'stage' environment and 'main' namespace.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: investigate-alert
Download link: https://github.com/dmzoneill/skills/archive/main.zip#investigate-alert

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.