ecc-eval-harness

Community

Eval-driven testing framework for Claude Code.

Authorkarimatayuta
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Formal evaluation framework that enables eval-driven development for Claude Code sessions, providing structured testing, progress tracking, and regression prevention.

Core Features & Use Cases

  • Capability Evals to validate new features
  • Regression Evals to prevent breakages
  • Diverse graders (Code-based, Model-based, Human) for flexible assessment
  • Metrics like pass@k and pass^k to measure reliability
  • End-to-end eval workflow from define to report

Quick Start

Run the eval workflow to define, implement tests, run checks with /eval, and generate a report to ship the feature.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ecc-eval-harness
Download link: https://github.com/karimatayuta/graph-vector-rag/archive/main.zip#ecc-eval-harness

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.