flow-fix-by-benchmarks
CommunityDiagnose skill failures with benchmarks.
Software Engineering#automation#debugging#verification#benchmark#skill#root-cause-analysis#assistflow
Authorkorchasa
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides a structured approach to debugging and improving AssistFlow skills by running benchmarks to identify root causes of failures and propose data-driven fixes.
Core Features & Use Cases
- Identify the relevant benchmark scenario for a skill under test located in benchmarks/<skill>/scenarios/.
- Run and analyze results using the benchmark-runner subagent to execute tests, collect outputs, and report Pass/Fail with evidence.
- Determine root causes, draft proposed fixes with argumentation, and prepare a verification plan to re-run the benchmarks for stability before applying changes.
Quick Start
Run a benchmark for a target skill and review the results before proposing fixes.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: flow-fix-by-benchmarks Download link: https://github.com/korchasa/flow/archive/main.zip#flow-fix-by-benchmarks Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.