ai-performance-testing
OfficialMeasure AI factual accuracy & consistency
Software Engineering#consistency#ai testing#llm evaluation#completeness#performance metrics#factual accuracy#deepeval
AuthorDTMC-marketplace
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill addresses the critical need to objectively measure and improve the reliability of AI systems, ensuring they provide accurate, complete, and consistent information.
Core Features & Use Cases
- Factual Accuracy Measurement: Quantifies how often an AI's responses are factually correct based on provided context, with a target of over 95%.
- Completeness Evaluation: Assesses if the AI's answers fully address the user's query and incorporate relevant information from the context, aiming for over 90% completeness.
- Consistency Scoring: Verifies that the AI maintains a high degree of consistency in its responses across different queries, targeting over 85%.
- Use Case: A company deploying a customer support chatbot can use this skill to rigorously test its accuracy and completeness before launch, preventing the dissemination of incorrect information and ensuring a positive user experience.
Quick Start
Use the ai-performance-testing skill to generate test data for evaluating an AI system.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ai-performance-testing Download link: https://github.com/DTMC-marketplace/governance/archive/main.zip#ai-performance-testing Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.