Skill Explorer

Searching protocol for "LLM judges"

validate-evaluator

Community

Calibrate LLM judges against human labels.

Advanced

byhamelsmu

llm-as-a-judge

Official

Build LLM evaluators for quality assessment.

Advanced

bymaragudk

write-judge-prompt

Community

Design LLM-as-Judge evaluators.

Few Config

bymarchatton

validate-evaluator

Community

Calibrate LLM judges against human labels.

Advanced

bymarchatton

write-judge-prompt

Community

Design LLM judges for subjective criteria.

Few Config

byhamelsmu

advanced-evaluation

Community

Make LLM judgments reliable with proven methods.

Advanced

bysamvanme

llm-evaluation

Community

LLM evaluation with automated benchmarks.

Advanced

byccf

llm-evaluation

Official

Measure LLM quality with rigorous evaluation.

Advanced

byaisa-group

sc-evaluate

Community

Evaluate LLM outputs with AI judges.

Advanced

byTony363

llm-evaluation

Community

LLM evaluation with automated metrics.

Advanced

byapassuello

sdd-implement

Community

Implement tasks with LLM-as-Judge verification.

Advanced

byGamezar

advanced-evaluation

Community

Scale LLM evaluation with bias-aware automation.

Advanced

byKalyanikhandare29