incident-responder
CommunityRapidly resolve critical production incidents.
Software Engineering#DevOps#post-mortem#observability#reliability#incident response#SRE#production outage
Authordrtonylove1963
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides immediate, structured, and expert response to critical production incidents, minimizing downtime and business impact. It ensures efficient problem resolution, clear communication, and continuous learning through blameless post-mortems, improving overall system reliability.
Core Features & Use Cases
- Rapid Incident Command: Establishes clear roles (Incident Commander, Communication Lead, Technical Lead) and immediate stabilization actions.
- Observability-Driven Investigation: Leverages distributed tracing, metrics correlation, log aggregation, and APM analysis for root cause identification.
- Structured Communication: Manages internal and external updates with appropriate technical depth and frequency, including status page updates.
- Use Case: A critical production service is down, impacting users globally. This Skill can immediately guide the incident response, establish command, suggest quick stabilization actions, and manage communication, ensuring a swift and coordinated recovery.
Quick Start
Use the incident-responder skill to guide the response for a P0 critical production outage, outlining immediate actions and communication strategy.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: incident-responder Download link: https://github.com/drtonylove1963/pronetheia-os/archive/main.zip#incident-responder Please download this .zip file, extract it, and install it in the .claude/skills/ directory.