train-visual-qa
CommunityIteratively tune visual QA prompts
Authorthomasttvo
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Ensures visual-qa analysis prompts reliably detect layout and spacing defects in mobile UI screenshots by automating evaluation against labeled test cases and preventing regressions caused by ad-hoc prompt edits.
Core Features & Use Cases
- Automated Test Harness: Loads captured test cases with expected signals and known false positives, runs the visual-qa prompt against each screenshot, and collects outputs for scoring.
- Deterministic Scoring: Compares outputs to ground truth to mark FOUND/MISSED signals and AVOIDED/FLAGGED false positives, applying an 80%+ pass threshold and zero flagged known false positives as pass criteria.
- Iterative Prompt Tuning: Guides structural prompt changes, re-runs the full suite to detect regressions, and repeats until results stabilize or iteration limits are reached.
- Use Case: Fine-tune a visual-qa analysis prompt to consistently find spacing and layout defects across a library of mobile screenshots before deploying to production QA pipelines.
Quick Start
Run the visual-qa trainer to evaluate the visual-qa prompt against the training test cases and iterate until pass criteria are met.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: train-visual-qa Download link: https://github.com/thomasttvo/claude-skills/archive/main.zip#train-visual-qa Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.