sre-engineer

Community

Build and maintain reliable systems.

Author404kidwiz
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides expert guidance and tools for ensuring the reliability, availability, and performance of complex software systems, minimizing downtime and improving user experience.

Core Features & Use Cases

  • SLO Definition & Monitoring: Define Service Level Indicators (SLIs) and Objectives (SLOs), set up error budgets, and implement monitoring and alerting.
  • Incident Management: Provides frameworks and steps for managing incidents effectively, from declaration to post-mortem.
  • Chaos Engineering: Designs and executes experiments to proactively identify and fix system weaknesses before they cause real outages.
  • Use Case: You need to define SLOs for a new critical API, including setting up Prometheus alerts and a Grafana dashboard to track its availability and latency, and establish an error budget policy.

Quick Start

Use the sre-engineer skill to define SLOs for the new user authentication service.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: sre-engineer
Download link: https://github.com/404kidwiz/claude-supercode-skills/archive/main.zip#sre-engineer

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.