golden-dataset-curation

Community

Automate golden-dataset curation with multi-agent QA.

Authoryonatangross
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the quality assessment and multi-agent curation of documents for a golden dataset, enabling scalable, auditable data curation.

Core Features & Use Cases

  • Content-type classification: automatically categorize incoming documents (article, tutorial, research_paper, etc.) to guide curation.
  • Quality assessment & consensus: run parallel agent evaluations on accuracy, coherence, depth, and relevance, then aggregate into a final decision.
  • Test query generation: produce retrieval test queries to validate coverage and retrieval performance.
  • End-to-end workflow: ingest new documents, run multi-agent quality checks, tag domains, and commit publish-ready entries when thresholds are met.

Quick Start

Provide a new document URL or content snippet to seed the golden-dataset-curation workflow, then trigger the curator to classify, evaluate, and generate a consensus-based inclusion decision. Use the Langfuse prompts and consensus outputs to guide the inclusion or review of the document.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: golden-dataset-curation
Download link: https://github.com/yonatangross/create-yg-app/archive/main.zip#golden-dataset-curation

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.