trulens-running-evaluations

Official

Run and compare TruLens evaluations across apps.

Authortruera
Version1.0.0
Installs0

System Documentation

What problem does it solve?

TruLens users need a streamlined workflow to run, orchestrate, and compare evaluations across different app versions and configurations, collecting results for analysis and decision making.

Core Features & Use Cases

  • Orchestrates single and batch TruLens evaluations across wrappers like TruChain, TruGraph, and TruLlama.
  • Aggregates results, surfaces leaderboard insights, and supports ground-truth alignment for rigorous comparisons.
  • Use Case: Instrument an app, configure feedbacks, run evaluations across v1 and v2, and compare results side-by-side in a dashboard.

Quick Start

Use this skill to run TruLens evaluations and retrieve results for visualization.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: trulens-running-evaluations
Download link: https://github.com/truera/trulens/archive/main.zip#trulens-running-evaluations

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.