dev-data
CommunityBuild reliable data pipelines.
Authorlidge-jun
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides a comprehensive guide to building robust and scalable data engineering pipelines, ensuring data quality and efficient processing.
Core Features & Use Cases
- Data Processing Principles: Learn essential rules for pipeline thinking, schema-first design, defensive parsing, idempotency, and fail-fast error handling.
- Ingestion Patterns: Guidance on handling various formats (CSV, JSON, Parquet, Excel, Database) and implementing incremental loading with schema validation.
- ETL/ELT Design: Understand layered architecture, error handling strategies, and orchestration basics for complex data workflows.
- Data Quality: Implement validation checks and data contracts to maintain data integrity.
- Analysis & Reporting: Best practices for summary statistics, output formats, and statistical reporting.
- Architecture Decisions: Insights into choosing between batch vs. streaming, storage solutions, and relevant tools.
- Use Case: You need to build a daily pipeline to ingest sales data from multiple sources, clean and transform it, and load it into a data warehouse for business intelligence reporting.
Quick Start
Follow the principles outlined in this guide to design a new data pipeline.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: dev-data Download link: https://github.com/lidge-jun/cli-jaw-skills/archive/main.zip#dev-data Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.