triage-failure

Official

Diagnose and fix benchmark failures.

Authorsourcegraph
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill helps developers quickly identify the root cause of failed benchmark tasks, understand the failure type, and receive actionable suggestions for remediation.

Core Features & Use Cases

  • Automated Failure Analysis: Parses logs and task outputs to diagnose errors.
  • Root Cause Identification: Categorizes failures into types like infrastructure, agent bugs, or task difficulty.
  • Suggested Fixes: Provides specific commands or code changes to resolve the issue.
  • Use Case: When a benchmark run fails, use this Skill to get an immediate diagnosis and a clear path to fixing the problem, rather than manually sifting through logs.

Quick Start

Use the triage-failure skill to investigate the most recent benchmark failure.

Dependency Matrix

Required Modules

jsonsysosre

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: triage-failure
Download link: https://github.com/sourcegraph/CodeScaleBench/archive/main.zip#triage-failure

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.