cleaning-data
OfficialMake data clean and analysis-ready.
Data & Analytics#SQL#validation#deduplication#data-quality#data-pipeline#data-cleaning#outlier-detection
Authortilmon-engineering
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides a structured approach to data quality remediation in DataPeeker sessions, enabling automatic detection and systematic remediation of duplicates, outliers, NULL handling, and free-text categorization to produce analysis-ready datasets.
Core Features & Use Cases
- Automated data cleaning pipeline: detect and remediate duplicates, outliers, and inconsistent categories, while tracking rationale.
- Phase-guided workflow: supports 5-phase process (scope, detection, strategy, execution, verification) with audit trail.
- Deployable in analysis sessions: integrates with importing-data and data-analytics workflows for reproducible results.
- Use Case: Before a guided-investigation or exploratory-analysis workflow, run cleaning-data to generate clean tables and quality reports ready for analysis.
Quick Start
Execute the cleaning pipeline on the current dataset to produce clean_[table] tables for downstream analyses.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: cleaning-data Download link: https://github.com/tilmon-engineering/claude-skills/archive/main.zip#cleaning-data Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.