Cekura Eval Design

Official

Design and run AI voice agent tests.

Authorcekura-ai
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill streamlines the creation, execution, and analysis of test scenarios (evaluators) for AI voice agents, ensuring comprehensive coverage and quality.

Core Features & Use Cases

  • Eval Design: Create detailed test scenarios with specific instructions, expected outcomes, and test profiles.
  • Test Infrastructure: Set up and manage test profiles for realistic caller data.
  • Execution & Analysis: Run tests in various modes (voice, text, WebSocket) and analyze results.
  • Use Case: You need to test your new AI agent's ability to handle appointment cancellations. Use this Skill to design a scenario where the simulated caller attempts to cancel, provides verification details, and the agent's response is evaluated against predefined success criteria.

Quick Start

Use the Cekura Eval Design skill to create a new evaluator for testing agent appointment booking.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: Cekura Eval Design
Download link: https://github.com/cekura-ai/claude-skills/archive/main.zip#cekura-eval-design

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.