ecc-eval-harness
CommunityEval-driven testing framework for Claude Code.
Authorkarimatayuta
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Formal evaluation framework that enables eval-driven development for Claude Code sessions, providing structured testing, progress tracking, and regression prevention.
Core Features & Use Cases
- Capability Evals to validate new features
- Regression Evals to prevent breakages
- Diverse graders (Code-based, Model-based, Human) for flexible assessment
- Metrics like pass@k and pass^k to measure reliability
- End-to-end eval workflow from define to report
Quick Start
Run the eval workflow to define, implement tests, run checks with /eval, and generate a report to ship the feature.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ecc-eval-harness Download link: https://github.com/karimatayuta/graph-vector-rag/archive/main.zip#ecc-eval-harness Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.