Eval Running Skill

Official

Benchmark Loa skill quality with eval suites.

Author0xHoneyJar
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Run evaluation suites to detect regressions and benchmark skill quality across the Loa framework.

Core Features & Use Cases

  • Supports framework correctness, regression, and skill-quality eval suites.
  • Enables baseline updates and result comparisons for CI and local validation.
  • Produces structured outputs (JSONL) for downstream analytics and auditing.

Quick Start

Run the evaluation harness to execute the framework correctness suite.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: Eval Running Skill
Download link: https://github.com/0xHoneyJar/loa-freeside/archive/main.zip#eval-running-skill

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.