Skill Explorer

Searching protocol for "BLEU"

llm-evaluation

Community

Automated and human evaluation for LLMs.

Advanced

by48Nauts-Operator

llm-evaluation

Community

Measure and improve LLM performance.

Advanced

byDrLuggels

llm_evaluation

Community

Benchmark and validate LLM performance.

Advanced

byvuralserhat86

llm-eval

Community

Measure and improve LLM performance.

Advanced

byLuisSambrano

llm-evaluation

Community

Master LLM evaluation strategies.

Advanced

byTriNgo0108

llm-evaluation

Community

Evaluate LLM applications rigorously.

Advanced

bywshobson

llm-evaluation

Community

Master LLM evaluation strategies.

Advanced

bydrgaciw

evaluation-metrics

Community

Measure and improve LLM quality.

Advanced

bypluginagentmarketplace

llm-evaluation

Official

Measure LLM quality with rigorous evaluation.

Advanced

byaisa-group

llm-evaluation

Community

Measure and improve LLM performance.

Advanced

byas4584

llm-evaluation

Community

Measure and improve LLM performance.

Advanced

byHermeticOrmus

llm-evaluation

Community

Evaluate LLM application performance.

Advanced

byamurata