hugging-face-dataset-creator

Community

Create and manage Hugging Face datasets with ease

AuthorNymbo
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill streamlines creating and managing datasets on the Hugging Face Hub, including initializing repos, configuring system prompts, and streaming updates with validation templates.

Core Features & Use Cases

  • Initialize datasets: Create repos with proper structure and README metadata.
  • Configure & prompt: Store detailed configuration including system prompts and metadata.
  • Stream updates: Add data rows efficiently with template-based validation.
  • Template support: Work with chat, classification, QA, completion, and tabular data templates.

Quick Start

Examples:

  • Initialize a new dataset: python scripts/dataset_manager.py init --repo_id "your-username/dataset-name" [--private]
  • Add rows with validation: python scripts/dataset_manager.py add_rows --repo_id "your-username/dataset-name" --template qa --rows_json '[{"question": "Sample?", "answer": "Sample answer."}]'

Dependency Matrix

Required Modules

huggingface_hub

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: hugging-face-dataset-creator
Download link: https://github.com/Nymbo/Skills/archive/main.zip#hugging-face-dataset-creator

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.