Skill Explorer

Searching protocol for "LLM as judge"

llm-as-a-judge

Official

Build LLM evaluators for quality assessment.

Advanced

bymaragudk

validate-evaluator

Community

Calibrate LLM judges against human labels.

Advanced

byhamelsmu

write-judge-prompt

Community

Design LLM-as-Judge evaluators.

Few Config

bymarchatton

validate-evaluator

Community

Calibrate LLM judges against human labels.

Advanced

bymarchatton

write-judge-prompt

Community

Design LLM judges for subjective criteria.

Few Config

byhamelsmu

llm-evaluation

Community

LLM evaluation with automated benchmarks.

Advanced

byccf

sdd-implement

Community

Implement tasks with LLM-as-Judge verification.

Advanced

byGamezar

advanced-evaluation

Community

Make LLM judgments reliable with proven methods.

Advanced

bysamvanme

sc-evaluate

Community

Evaluate LLM outputs with AI judges.

Advanced

byTony363

llm-evaluation

Official

Measure LLM quality with rigorous evaluation.

Advanced

byaisa-group

mlflow-genai-evaluation

Community

Evaluate GenAI agents with MLflow

Advanced

byprashsub

llm-evaluation

Community

LLM evaluation with automated metrics.

Advanced

byapassuello