ai-performance-testing

Official

Measure AI factual accuracy & consistency

AuthorDTMC-marketplace
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill addresses the critical need to objectively measure and improve the reliability of AI systems, ensuring they provide accurate, complete, and consistent information.

Core Features & Use Cases

  • Factual Accuracy Measurement: Quantifies how often an AI's responses are factually correct based on provided context, with a target of over 95%.
  • Completeness Evaluation: Assesses if the AI's answers fully address the user's query and incorporate relevant information from the context, aiming for over 90% completeness.
  • Consistency Scoring: Verifies that the AI maintains a high degree of consistency in its responses across different queries, targeting over 85%.
  • Use Case: A company deploying a customer support chatbot can use this skill to rigorously test its accuracy and completeness before launch, preventing the dissemination of incorrect information and ensuring a positive user experience.

Quick Start

Use the ai-performance-testing skill to generate test data for evaluating an AI system.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ai-performance-testing
Download link: https://github.com/DTMC-marketplace/governance/archive/main.zip#ai-performance-testing

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.