run-evaluation

Community

Post-run analysis to improve agent performance.

AuthorKjdragan
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill performs a post-mortem analysis of the latest Universal Agent run by inspecting run.log, session outputs, and Logfire traces to identify errors, bottlenecks, and opportunities for improvement.

Core Features & Use Cases

  • Automated post-mortem analysis: Analyze run.log, session outputs, and Logfire traces to surface actionable insights.
  • Problem identification: Detect anomalies, exceptions, and deviations from a happy path.
  • Recommendations: Generate concrete improvement suggestions for future runs, tuning, and debugging.

Quick Start

Run the evaluation workflow after an agent run completes using the steps described in the Skill guide.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: run-evaluation
Download link: https://github.com/Kjdragan/universal_agent/archive/main.zip#run-evaluation

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.