Skill Explorer

Searching protocol for "data evaluation"

mlflow-evaluation

Official

MLflow GenAI evaluation workflows for agents.

Advanced

bydatabricks-solutions

databricks-mlflow-evaluation

Community

End-to-end GenAI evaluation with MLflow.

Advanced

byandregit2026

fiftyone-model-evaluation

Official

Evaluate model predictions against ground truth.

Few Config

byvoxel51

evaluation-metrics

Community

Rigorous, reproducible LLM evaluation.

Advanced

byricardoroche

eval

Official

Orchestrate robust AI evaluations with EvalKit.

Advanced

byHappyverse-Team

mlflow-evaluation

Community

MLflow GenAI evaluation for quality.

Advanced

bydatasciencemonkey

databricks-mlflow-evaluation

Community

End-to-end MLflow GenAI evaluation for Databricks.

Advanced

bymirakui

langsmith-evaluator

Official

Build scalable, code-driven LangSmith evaluators.

Advanced

bylangchain-ai

databricks-mlflow-evaluation

Official

GenAI evaluation with MLflow metrics

Advanced

bymkgs-databricks-demos

eval-engine

Community

LLM evaluation pipeline

Advanced

bymqzkim

huggingface-evaluate

Official

Evaluate ML models & datasets

Few Config

byDTMC-marketplace

langfuse-dataset-setup

Community

Set up Langfuse datasets & evaluations

Few Config

bymberto10