golden-dataset-curation
CommunityAutomate golden-dataset curation with multi-agent QA.
Authoryonatangross
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill automates the quality assessment and multi-agent curation of documents for a golden dataset, enabling scalable, auditable data curation.
Core Features & Use Cases
- Content-type classification: automatically categorize incoming documents (article, tutorial, research_paper, etc.) to guide curation.
- Quality assessment & consensus: run parallel agent evaluations on accuracy, coherence, depth, and relevance, then aggregate into a final decision.
- Test query generation: produce retrieval test queries to validate coverage and retrieval performance.
- End-to-end workflow: ingest new documents, run multi-agent quality checks, tag domains, and commit publish-ready entries when thresholds are met.
Quick Start
Provide a new document URL or content snippet to seed the golden-dataset-curation workflow, then trigger the curator to classify, evaluate, and generate a consensus-based inclusion decision. Use the Langfuse prompts and consensus outputs to guide the inclusion or review of the document.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: golden-dataset-curation Download link: https://github.com/yonatangross/create-yg-app/archive/main.zip#golden-dataset-curation Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.