Eval Running Skill
OfficialBenchmark Loa skill quality with eval suites.
Author0xHoneyJar
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Run evaluation suites to detect regressions and benchmark skill quality across the Loa framework.
Core Features & Use Cases
- Supports framework correctness, regression, and skill-quality eval suites.
- Enables baseline updates and result comparisons for CI and local validation.
- Produces structured outputs (JSONL) for downstream analytics and auditing.
Quick Start
Run the evaluation harness to execute the framework correctness suite.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: Eval Running Skill Download link: https://github.com/0xHoneyJar/loa-freeside/archive/main.zip#eval-running-skill Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.