data-pipeline-design
OfficialBuild robust data pipelines.
AuthorHarvest-Forged-Code
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill addresses the complexities of designing, building, and maintaining reliable data pipelines, ensuring data integrity, efficient processing, and operational visibility.
Core Features & Use Cases
- Pipeline Pattern Selection: Guides users in choosing between ETL, ELT, and streaming patterns based on specific requirements.
- Stage Design: Provides detailed guidance on designing extract, transform, and load stages, including incremental loading, validation, and upsert strategies.
- Error Handling & Monitoring: Emphasizes robust error handling, dead-letter queues, retry mechanisms, alerting, and comprehensive monitoring for data quality and pipeline operations.
- Idempotency & Checkpointing: Ensures pipelines can be re-run safely and efficiently by implementing idempotency and checkpointing for long-running processes.
- Use Case: Design an ELT pipeline to ingest daily sales data from multiple APIs into a data warehouse, ensuring data is validated, transformed into a star schema, and monitored for any processing anomalies.
Quick Start
Use the data-pipeline-design skill to design an ETL pipeline for ingesting customer data from a PostgreSQL database into an S3 data lake.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: data-pipeline-design Download link: https://github.com/Harvest-Forged-Code/Analyser/archive/main.zip#data-pipeline-design Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.