Skill Explorer

Searching protocol for "judge prompt"

write-judge-prompt

Community

Design LLM-as-Judge evaluators.

Few Config

bymarchatton

llm-as-a-judge

Official

Build LLM evaluators for quality assessment.

Advanced

bymaragudk

prompt-engineering-research

Community

Optimize AI image prompts.

Advanced

byNikGor

write-judge-prompt

Community

Design LLM judges for subjective criteria.

Few Config

byhamelsmu

validate-evaluator

Community

Calibrate LLM judges against human labels.

Advanced

bymarchatton

advanced-evaluation

Community

Scale LLM evaluation with bias-aware automation.

Advanced

byKalyanikhandare29

validate-evaluator

Community

Calibrate LLM judges against human labels.

Advanced

byhamelsmu

noise-audit

Community

Quantify decision noise with independent juries.

Few Config

byFolahanWilliams

langfuse-dataset-setup

Community

Set up Langfuse datasets & evaluations

Few Config

bymberto10

yokatta-finder

Community

Generate positive posts without judgment.

Few Config

byKG-NINJA

advanced-evaluation

Community

Turn model outputs into reliable judgments.

Advanced

bymuratcankoylan

databricks-mlflow-evaluation

Community

End-to-end MLflow GenAI evaluation for Databricks.

Advanced

bymirakui