Skill Explorer

Searching protocol for "human-eval"

llm-evaluation

Community

Automated and human evaluation for LLMs.

Advanced

by48Nauts-Operator

llm-evaluation-metrics

Community

Validate LLM performance, ensure quality.

Advanced

bytachyon-beep

llm-evaluation

Official

Quantify LLMs with robust metrics.

Advanced

byKingly-Agency

llm-evaluation

Community

Measure and improve LLM performance.

Advanced

byDrLuggels

llm_evaluation

Community

Benchmark and validate LLM performance.

Advanced

byvuralserhat86

llm-evaluation

Community

Benchmark and validate LLM performance.

Advanced

byyusufcmg

llm-evaluation

Community

Benchmark and validate LLM performance.

Advanced

bybugrabilge

llm-evaluation

Official

Measure and improve LLM performance.

Advanced

byNOMARJ

llm-evaluation

Community

Measure and improve LLM performance.

Advanced

byHermeticOrmus

llm-evaluation

Community

Evaluate LLM performance rigorously.

Advanced

bycuoreinpace

evaluation

Community

Quantify agent performance with scalable evaluation.

No Config

byIkram-Alam

llm-evaluation

Community

LLM evaluation with metrics and benchmarks.

Advanced

byAndyAnh174