Skill Explorer

Searching protocol for "judging"

run-judges

Official

Automate quality evaluation of plans and code.

Advanced

byclosedloop-ai

trace

Community

Trace judgments on the PoJ chain.

Advanced

byzeyxx

judge

Community

Dual-judge evaluation for task commits.

Advanced

bydcarmitage

validate-evaluator

Community

Calibrate LLM judges against human labels.

Advanced

byhamelsmu

write-judge-prompt

Community

Design LLM-as-Judge evaluators.

Few Config

bymarchatton

write-judge-prompt

Community

Design LLM judges for subjective criteria.

Few Config

byhamelsmu

aiconfig-online-evals

Official

Evaluate AI Configs with built-in judges.

Few Config

bylaunchdarkly-labs

judge

Community

25-dimension judgments for content quality.

Advanced

byzeyxx

sadd:do-and-judge

Community

Orchestrate tasks with judge verification.

Advanced

bydalawwa

validate-evaluator

Community

Calibrate LLM judges against human labels.

Advanced

bymarchatton

sadd-judge

Community

Evaluate work with an AI judge.

Advanced

byGamezar

llm-as-a-judge

Official

Build LLM evaluators for quality assessment.

Advanced

bymaragudk