Name: flow-skill-write-agent-benchmarks
Availability: InStock
Author: korchasa

System Documentation

What problem does it solve?

Benchmarks that objectively evaluate AI agents in controlled, verifiable environments, enabling reproducible assessment and auditable results.

Core Features & Use Cases

Standardized evaluation workflows for AI agents across CLI/IDE, API, and chat interfaces.
Isolated, deterministic environments with artifact-focused evidence collection and traceability.
End-to-end benchmarking scenarios with a universal result schema for cross-platform comparison and reporting.

Quick Start

Run the benchmark workflow to initialize an environment, execute a scenario, and generate a report.

Please help me install this Skill: Name: flow-skill-write-agent-benchmarks Download link: https://github.com/korchasa/flow/archive/main.zip#flow-skill-write-agent-benchmarks Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

flow-skill-write-agent-benchmarks

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper