data-cleaner
CommunityAutomate data cleaning and prep for modeling.
AuthorSPIRAL-EDWIN
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Data quality is foundational for effective modeling; messy CSV/Excel inputs cause biased results and wasted time. The Data-Cleaner standardizes loading, inspecting, and cleaning datasets to ensure reproducible inputs for analytics and modeling.
Core Features & Use Cases
- Load and inspect data from common formats (CSV, Excel, JSON) and show shape, dtypes, missing values, and basic statistics.
- Handle missing values with robust strategies (auto, median, forward-fill, interpolate) and drop heavily incomplete columns.
- Detect and cap or remove outliers using IQR or Z-score methods.
- Normalize numeric features with standard, min-max, or robust scaling.
- Fix data types (dates to datetime, categoricals) to improve downstream analysis.
- Produce a cleaned dataset (processed.csv) and a cleaning report (where the pipeline decisions are documented).
Quick Start
To start, run the cleaning pipeline on your raw data file: clean_data(filepath='data/raw_data.csv', output_path='data/processed.csv', normalize=False). Then review the generated processed.csv and processed_report.json to confirm improvements.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: data-cleaner Download link: https://github.com/SPIRAL-EDWIN/MCM-ICM-2601000/archive/main.zip#data-cleaner Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.