sre-expert
CommunityMaster SRE: SLOs, incidents, and ops excellence.
Software Engineering#monitoring#observability#reliability#chaos engineering#sre#incident management#slo
AuthorJonathanMitchell1234
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides expert guidance on Site Reliability Engineering (SRE) principles, enabling teams to improve system reliability, manage incidents effectively, and achieve operational excellence.
Core Features & Use Cases
- SLO/SLI Management: Define, track, and calculate compliance for Service Level Objectives and Indicators.
- Incident Management: Create, update, and report on incidents, including MTTR calculation.
- Monitoring & Alerting: Set up monitoring metrics and define alerting rules.
- Chaos Engineering: Design and run experiments to test system resilience.
- Use Case: A team can use this Skill to define SLOs for their API, track performance against those SLOs, and automatically generate incident reports when service levels are breached.
Quick Start
Use the sre-expert skill to define standard SLOs for a web service.
Dependency Matrix
Required Modules
None requiredComponents
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: sre-expert Download link: https://github.com/JonathanMitchell1234/Stock-Swing-Trading-Bot/archive/main.zip#sre-expert Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.