validate-rubrics
OfficialHarden evaluation rubrics against failure.
AuthorAnkh-Studio
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill addresses the critical issue of unreliable or vulnerable evaluation rubrics that can lead to incorrect, incomplete, or biased results. It proactively identifies and rectifies these weaknesses.
Core Features & Use Cases
- Adversarial Stress-Testing: Simulates edge cases and attack vectors to uncover rubric vulnerabilities.
- Vulnerability Analysis: Identifies specific ways a rubric might fail, rating severity and likelihood.
- Revision and Validation: Proposes concrete improvements and validates the hardened rubric against identified edge cases.
- Use Case: A team developing an AI model for content moderation uses this Skill to ensure their rubric for evaluating harmful content is robust and doesn't miss nuanced cases or produce biased classifications.
Quick Start
Use the validate-rubrics skill to stress-test the rubric file located at rubrics/prompt.md.
Dependency Matrix
Required Modules
None requiredComponents
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: validate-rubrics Download link: https://github.com/Ankh-Studio/copilot-enterprise-eval-plugin/archive/main.zip#validate-rubrics Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.