investigate-alert
CommunityTriage firing Prometheus alerts instantly.
Authordmzoneill
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill rapidly triages firing Prometheus alerts by gathering essential context like pod health, events, and logs, enabling quick assessment and escalation.
Core Features & Use Cases
- Alert Triage: Automatically fetches firing alerts from Prometheus.
- System Health Check: Assesses Kubernetes pod health, resource usage, and recent events.
- Log Analysis: Searches Kibana for relevant errors and exceptions.
- Pattern Matching: Compares issues against known patterns for faster diagnosis.
- Escalation: Automatically escalates critical alerts to
debug_prodif necessary. - Use Case: When a critical alert fires, this Skill provides a comprehensive overview of the system's status, helping to pinpoint the root cause without manual intervention.
Quick Start
Investigate the firing Prometheus alerts for the 'stage' environment and 'main' namespace.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: investigate-alert Download link: https://github.com/dmzoneill/skills/archive/main.zip#investigate-alert Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.