data-cleaner

Community

Automate data cleaning and prep for modeling.

AuthorSPIRAL-EDWIN
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Data quality is foundational for effective modeling; messy CSV/Excel inputs cause biased results and wasted time. The Data-Cleaner standardizes loading, inspecting, and cleaning datasets to ensure reproducible inputs for analytics and modeling.

Core Features & Use Cases

  • Load and inspect data from common formats (CSV, Excel, JSON) and show shape, dtypes, missing values, and basic statistics.
  • Handle missing values with robust strategies (auto, median, forward-fill, interpolate) and drop heavily incomplete columns.
  • Detect and cap or remove outliers using IQR or Z-score methods.
  • Normalize numeric features with standard, min-max, or robust scaling.
  • Fix data types (dates to datetime, categoricals) to improve downstream analysis.
  • Produce a cleaned dataset (processed.csv) and a cleaning report (where the pipeline decisions are documented).

Quick Start

To start, run the cleaning pipeline on your raw data file: clean_data(filepath='data/raw_data.csv', output_path='data/processed.csv', normalize=False). Then review the generated processed.csv and processed_report.json to confirm improvements.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: data-cleaner
Download link: https://github.com/SPIRAL-EDWIN/MCM-ICM-2601000/archive/main.zip#data-cleaner

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.