Skill Explorer

Searching protocol for "run evaluation"

trulens-evaluation-workflow

Official

Orchestrate end-to-end LLM app evaluations.

Advanced

bytruera

phoenix-evals

Official

Build and run AI evaluators with Phoenix.

Advanced

byArize-ai

langsmith-evaluator

Community

Build and run LangSmith evaluations.

Few Config

bydhar174

LangSmith Evaluators

Official

Build and run robust AI evaluations.

Few Config

bylangchain-ai

trulens-running-evaluations

Official

Run and compare TruLens evaluations across apps.

Few Config

bytruera

hugging-face-evaluation

Community

Publish and manage Hugging Face model evaluations.

Few Config

bypatchy631

hugging-face-evaluation-manager

Community

Add and manage evaluation results in model cards

Advanced

byNymbo

trulens-evaluation-setup

Official

Effortlessly configure TruLens evaluations.

Advanced

bytruera

evaluate-presets

Community

Test and validate Ralph's presets efficiently.

Advanced

bymikeyobrien

agentos-api-evals

Community

Manage AgentOS evaluations

Few Config

byajshedivy

eval-engine

Community

LLM evaluation pipeline

Advanced

bymqzkim

eval-check

Community

Verify agent quality automatically.

Advanced

byrosinbum