Skill Explorer

Searching protocol for "evaluator-pipelines"

DeepAgent

Community

Validate AI research, ensure robust insights.

No Config

bystarwreckntx

llm-evaluation

Community

Benchmark LLMs with automated evaluation pipelines

Advanced

bykarstenheld3

genai-agents-setup

Community

Build production-ready GenAI agents on Databricks.

Advanced

byprashsub

eval

Community

Evaluate AI agents systematically.

Advanced

byParth576

eval-engine

Community

LLM evaluation pipeline

Advanced

bymqzkim

llmops-operations

Community

Optimize LLM apps: design, deploy, evaluate.

Advanced

bytake566

openjudge

Official

Build LLM evaluation pipelines.

Advanced

byagentscope-ai

advanced-evaluation

Official

Reliable LLM evaluation with bias mitigation.

Advanced

byShakudo-io

eval-audit

Community

Audit LLM evals for trust.

Advanced

byhamelsmu

advanced-evaluation

Community

Make LLM judgments reliable with proven methods.

Advanced

bysamvanme

advanced-evaluation

Community

Production-grade evaluation patterns for LLMs.

Advanced

byrohunvora

eval-audit

Community

Audit LLM evals for trust and impact.

Advanced

bymarchatton