managing-incidents

Community

Master incident response with SRE best practices.

Authorancoleman
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides a comprehensive framework for managing incidents, from initial detection and response to blameless post-mortems, ensuring efficient resolution and continuous improvement.

Core Features & Use Cases

  • Incident Lifecycle Management: Covers detection, triage, declaration, investigation, mitigation, resolution, and closure.
  • Role Definition: Clearly outlines responsibilities for Incident Commander (IC), Communications Lead, SMEs, and Scribe.
  • Blameless Post-Mortems: Guides teams through conducting effective post-mortems to learn from failures.
  • Use Case: When a critical service outage occurs (SEV0/SEV1), use this Skill to guide the response team through the established incident management workflow, ensuring clear communication and timely resolution.

Quick Start

Use the managing-incidents skill to classify the severity of an ongoing incident.

Dependency Matrix

Required Modules

flaskslack_sdkgoogle-authgoogle-auth-oauthlibgoogle-auth-httplib2google-api-python-clientrequests

Components

scriptsreferencesexamples

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: managing-incidents
Download link: https://github.com/ancoleman/ai-design-components/archive/main.zip#managing-incidents

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.