Skill Explorer

Searching protocol for "graders"

redteam-plugin-development

Official

Standards for redteam plugins and graders.

Advanced

bypromptfoo

eval-harness

Community

Eval-driven testing for reliable Claude Code.

Advanced

byEotel

code-quality-grader

Community

Automated code quality grader for CI reviews.

Few Config

byerhankaraarslan

anthropic-evaluations

Community

Design and run robust AI agent evaluations.

No Config

bydwmkerr

eval-harness

Community

Formalize AI development with evals.

Advanced

bybbaserdem

eval-harness

Community

Rigorous evaluation framework for AI features.

Advanced

byBenjaminRose805

Evals

Community

Objective eval metrics via code/model/human graders

Advanced

byRooseveltAdvisors

evals

Community

Plan, run, and analyze AI evals.

Advanced

bycamronh

eval-harness

Community

Formal eval framework for Claude Code sessions.

Advanced

byzzh0u

eval

Community

Define, run, and report evals before coding.

Few Config

byfancive

ai-color-grader

Community

Color-grade videos with anime-style presets.

Few Config

byLutra23

pedagogical-code-grader

Community

Grade code by concept mastery, not only accuracy.

Few Config

byjorgealves