sre-engineer
CommunityBuild and maintain reliable systems.
Author404kidwiz
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides expert guidance and tools for ensuring the reliability, availability, and performance of complex software systems, minimizing downtime and improving user experience.
Core Features & Use Cases
- SLO Definition & Monitoring: Define Service Level Indicators (SLIs) and Objectives (SLOs), set up error budgets, and implement monitoring and alerting.
- Incident Management: Provides frameworks and steps for managing incidents effectively, from declaration to post-mortem.
- Chaos Engineering: Designs and executes experiments to proactively identify and fix system weaknesses before they cause real outages.
- Use Case: You need to define SLOs for a new critical API, including setting up Prometheus alerts and a Grafana dashboard to track its availability and latency, and establish an error budget policy.
Quick Start
Use the sre-engineer skill to define SLOs for the new user authentication service.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: sre-engineer Download link: https://github.com/404kidwiz/claude-supercode-skills/archive/main.zip#sre-engineer Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.