add-dataset

Official

Easily add new datasets to AReaL.

AuthorinclusionAI
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill helps data engineers and ML engineers add new dataset loaders to AReaL, standardizing the process of integrating diverse data sources for SFT and RL workflows.

Core Features & Use Cases

  • Template-based scaffolding for creating areal/dataset/<name>.py loaders and updates to areal/dataset/init.py.
  • Dual-mode support for SFT and RL datasets, including required fields and processing steps.
  • Validation guidance and testing scaffolds to ensure the loaders produce HuggingFace Datasets with expected schema.

Quick Start

Use the add-dataset skill to scaffold a new dataset loader and register it in the areal project.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: add-dataset
Download link: https://github.com/inclusionAI/AReaL/archive/main.zip#add-dataset

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.