Name: mcs-eval
Availability: InStock
Author: microsoft

System Documentation

What problem does it solve?

This Skill automates the evaluation of Copilot Studio agents, ensuring they meet quality and boundary standards before deployment. It identifies and reports on performance issues, enabling targeted fixes.

Core Features & Use Cases

Automated Testing: Runs predefined evaluation sets against your agent using Direct Line API or MCS Native Eval.
Mode Flexibility: Supports both fast, automated Direct Line testing and robust MCS Native Eval for agents with complex tools.
Detailed Reporting: Writes test results directly into brief.json for dashboard visibility and generates CSVs for reference.
Use Case: After building a new customer support agent, use this Skill to run a suite of tests covering common queries, edge cases, and quality metrics. The results will highlight any areas where the agent fails to provide accurate or appropriate responses.

Quick Start

Run all evaluation sets for the project 'MyProject' and agent 'MyAgent'.

Please help me install this Skill: Name: mcs-eval Download link: https://github.com/microsoft/MCS-Agent-Builder/archive/main.zip#mcs-eval Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

mcs-eval

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper