Failure Mode Analysis
CommunityIdentify and mitigate system failures.
Software Engineering#system design#resilience#failure analysis#risk assessment#availability#mitigation
Authordtsong
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill helps proactively identify potential failure scenarios in systems and design effective mitigation strategies to ensure resilience and availability.
Core Features & Use Cases
- Systematic Failure Discovery: Enumerate potential failure modes for each component of a system.
- Cascade Risk Assessment: Map how failures in one component can impact others.
- Mitigation Design: Develop strategies for graceful degradation, retries, and fallbacks.
- Monitoring & Rollback Planning: Define signals for detection and plan rollback procedures.
- Use Case: Before deploying a new microservice, use this Skill to brainstorm all the ways it could fail (e.g., database connection issues, API timeouts, resource exhaustion) and define how the system will behave and recover in each scenario.
Quick Start
Analyze potential failure modes for the new user authentication service.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: Failure Mode Analysis Download link: https://github.com/dtsong/claude-code-windows-setup/archive/main.zip#failure-mode-analysis Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.