Skill Explorer

Searching protocol for "performance evaluation"

llm-evaluation

Community

Measure and improve LLM performance.

Advanced

byDrLuggels

evaluator

Community

Systematically validate AI performance while you rest.

Advanced

byoimiragieo

eval-performance

Official

Optimize MSBuild build evaluation.

Few Config

bydotnet

eval-model

Community

Evaluate and compare model performance

Advanced

bymaminul007

agent-evaluation

Community

Measure and improve agent performance.

Advanced

byeyadsibai

evaluation

Community

Measure agent performance with multi-dim rubric.

Advanced

byAsmayaseen

llm-evaluation

Community

Evaluate LLM performance rigorously.

Advanced

bycuoreinpace

Agent Evaluation

Community

Score agent performance

Few Config

bycdalsoniii

evaluation

Official

Quantify agent performance with robust evaluation

Advanced

byShakudo-io

elevenlabs-evaluator

Community

Evaluate ElevenLabs voice quality

Few Config

byMythologIQ

performance-evaluate-questions

Community

AI-powered performance review summaries

Few Config

bydmzoneill

nemo-evaluator-sdk

Community

Benchmark LLMs at scale.

Advanced

byihatesea69