data-cleaning
CommunityTransform messy data into insights.
Authorseb1n
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill tackles the pervasive issue of messy, inconsistent, and incomplete data, transforming raw datasets into a reliable format ready for analysis and machine learning.
Core Features & Use Cases
- Handles Missing Values: Imputes or flags missing data based on column type and missingness patterns.
- Deduplication: Identifies and removes exact and near-duplicate records.
- Type Coercion & Standardization: Corrects data types and standardizes formats (dates, strings, numbers).
- Outlier Detection & Treatment: Identifies and handles outliers using statistical methods.
- Schema Validation: Enforces data quality rules to ensure consistency.
- Use Case: Clean a customer database with missing emails, duplicate entries, and inconsistent region names before running a marketing campaign.
Quick Start
Use the data-cleaning skill to clean the attached file 'customer_data.csv' and save the output to 'cleaned_customer_data.csv'.
Dependency Matrix
Required Modules
pandaspyjanitorgreat_expectationsfuzzywuzzynumpy
Components
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: data-cleaning Download link: https://github.com/seb1n/awesome-ai-agent-skills/archive/main.zip#data-cleaning Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.