Name: genie-benchmark-generator
Availability: InStock
Author: databricks-solutions

System Documentation

What problem does it solve?

This Skill automates the creation, validation, and synchronization of benchmark questions for Genie Space optimization, ensuring robust evaluation of AI agent performance.

Core Features & Use Cases

Multi-path Intake: Handles user-provided questions (10+, 1-9), or generates synthetic benchmarks from scratch.
Ground Truth Validation: Executes SQL queries against the live warehouse to verify correctness and store results.
MLflow Integration: Syncs validated benchmarks to MLflow Evaluation Datasets for seamless integration with GenAI evaluation workflows.
Use Case: Before optimizing a Genie Space for cost analysis, use this Skill to generate a comprehensive set of benchmark questions, validate their expected SQL, and prepare them for MLflow evaluation, ensuring the optimization loop has reliable test cases.

Quick Start

Use the genie-benchmark-generator skill to create and validate benchmarks for the 'cost' domain in the 'main.genie_benchmarks' Unity Catalog schema.

Please help me install this Skill: Name: genie-benchmark-generator Download link: https://github.com/databricks-solutions/vibe-coding-workshop-template/archive/main.zip#genie-benchmark-generator Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

genie-benchmark-generator

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper