Skill Explorer

Searching protocol for "llm evaluation"

advanced-evaluation

Community

Master LLM evaluation with AI judges.

Advanced

byrustams

llm-evaluation

Community

Automated and human evaluation for LLMs.

Advanced

by48Nauts-Operator

llm-evaluation

Community

Master LLM evaluation for accurate, reliable AI apps.

Advanced

bycamoneart

phoenix-evals

Official

Build and run AI evaluators with Phoenix.

Advanced

byArize-ai

evaluating-llms-harness

Community

Benchmark LLM performance across academic tasks.

Few Config

bytianhao909

advanced-evaluation

Community

Build robust LLM evaluation systems.

Advanced

byboazcstrike

llm-evaluation

Community

Benchmark LLMs with automated evaluation pipelines

Advanced

bykarstenheld3

llm-evaluation

Community

Master LLM evaluation strategies

Advanced

byHimanshu040604

evals-write-spec

Official

Author LLM evaluation specs.

Few Config

byelastic

advanced-evaluation

Community

Master LLM evaluation with robust, bias-free techniques.

Advanced

bynorthseadl

evaluating-llms-harness

Community

Benchmark LLMs against academic standards.

Advanced

byAXGZ21

llm_evaluation

Community

Benchmark and validate LLM performance.

Advanced

byvuralserhat86