Name: deepeval
Availability: InStock
Author: DTMC-marketplace

System Documentation

What problem does it solve?

This Skill addresses the need to rigorously evaluate the performance and compliance of Large Language Models (LLMs) against regulatory standards and identify potential risks.

Core Features & Use Cases

LLM Evaluation: Test LLMs for hallucination, toxicity, bias, and answer relevancy.
Compliance Assessment: Evaluate AI systems against EU AI Act Article 15 requirements.
Risk Mitigation: Implement controls and monitoring for AI performance risks.
Use Case: A development team is building an AI chatbot for customer service. They use this Skill to ensure the chatbot's responses are accurate, unbiased, and do not violate any regulatory guidelines before deployment.

Quick Start

Use the deepeval skill to evaluate the LLM for compliance with Art. 15 requirements.

Please help me install this Skill: Name: deepeval Download link: https://github.com/DTMC-marketplace/governance/archive/main.zip#deepeval Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

deepeval

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper