Skill Explorer

Searching protocol for "Evaluate"

langsmith-evaluator

Community

Build and run LangSmith evaluations.

Few Config

bydhar174

trulens-evaluation-workflow

Official

Orchestrate end-to-end LLM app evaluations.

Advanced

bytruera

phoenix-evals

Official

Build and run AI evaluators with Phoenix.

Advanced

byArize-ai

re-evaluate

Official

Auto re-evaluate attempts after changes.

Advanced

bybrazil-bench

LangSmith Evaluators

Official

Build and run robust AI evaluations.

Few Config

bylangchain-ai

advanced-evaluation

Community

LLM-based evaluation patterns for scale.

Advanced

bygeorgeguimaraes

loop-fixer

Community

Automatic fixes for failed evaluations

Advanced

byAhmedElhadarey

loop-fixer

Community

Automate evaluation-fix loops end-to-end.

Advanced

byIbrahim-3d

agent-evaluation

Community

Evaluate and optimize LLM agents.

Advanced

byslysik

production-eval-strategy

Official

Evaluate agents in production with robust scoring

Advanced

bynexus-labs-automation

databricks-mlflow-evaluation

Community

End-to-end GenAI evaluation with MLflow.

Advanced

byandregit2026

langsmith-evaluator

Official

Build scalable, code-driven LangSmith evaluators.

Advanced

bylangchain-ai