Skill Explorer

Searching protocol for "llm-as-judge"

sdd-implement

Community

Implement tasks with LLM-as-Judge verification.

Advanced

byGamezar

write-judge-prompt

Community

Design LLM-as-Judge evaluators.

Few Config

bymarchatton

llm-as-judge

Official

Subjective quality evaluation with LLMs.

Advanced

byPixel-Process-UG

agent-evaluation

Community

Rigorous agent testing and validation.

Advanced

bysharkitect-solutions

prompt-engineering-research

Community

Optimize AI image prompts.

Advanced

byNikGor

LangSmith Evaluators

Official

Build and run robust AI evaluations.

Few Config

bylangchain-ai

agentic-eval

Community

Iteratively evaluate and refine AI agent outputs.

Advanced

byMudassarAbrar

evaluating-llms

Community

Measure and improve LLM performance.

Advanced

byionmidori

langsmith-evaluator

Community

Build and run LangSmith evaluations.

Few Config

bydhar174

mlflow-genai-evaluation

Community

Evaluate GenAI agents with MLflow

Advanced

byprashsub

langsmith-evaluator

Community

Build and deploy LangSmith evaluators.

Few Config

byjackjin1997

llm-evaluation

Community

Evaluate LLM application performance.

Advanced

byamurata