LangSmith Datasets
OfficialCreate and manage LangSmith evaluation datasets.
AuthorDiploma-pending
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill streamlines the process of creating, managing, and uploading evaluation datasets for testing and validation, directly from exported trace files.
Core Features & Use Cases
- Dataset Generation: Automatically create datasets in various formats (final_response, single_step, trajectory, RAG) from LangSmith trace exports.
- LangSmith Upload: Seamlessly upload generated datasets to your LangSmith workspace for organized evaluation.
- Use Case: After running experiments, export your traces and use this Skill to generate a
final_responsedataset, then upload it to LangSmith as "My Experiment Results" for easy comparison and analysis.
Quick Start
Use the langsmith-datasets skill to generate a final_response dataset from the traces in the './traces' directory and upload it to LangSmith with the name 'My Skill Datasets'.
Dependency Matrix
Required Modules
langsmithclickrichpython-dotenvcommanderchalkcli-table3dotenv
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: LangSmith Datasets Download link: https://github.com/Diploma-pending/test-case/archive/main.zip#langsmith-datasets Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.