LangSmith Datasets

Official

Create and manage LangSmith evaluation datasets.

AuthorDiploma-pending
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill streamlines the process of creating, managing, and uploading evaluation datasets for testing and validation, directly from exported trace files.

Core Features & Use Cases

  • Dataset Generation: Automatically create datasets in various formats (final_response, single_step, trajectory, RAG) from LangSmith trace exports.
  • LangSmith Upload: Seamlessly upload generated datasets to your LangSmith workspace for organized evaluation.
  • Use Case: After running experiments, export your traces and use this Skill to generate a final_response dataset, then upload it to LangSmith as "My Experiment Results" for easy comparison and analysis.

Quick Start

Use the langsmith-datasets skill to generate a final_response dataset from the traces in the './traces' directory and upload it to LangSmith with the name 'My Skill Datasets'.

Dependency Matrix

Required Modules

langsmithclickrichpython-dotenvcommanderchalkcli-table3dotenv

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: LangSmith Datasets
Download link: https://github.com/Diploma-pending/test-case/archive/main.zip#langsmith-datasets

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.