data-pipeline-manager
CommunityBuild and safeguard robust data pipelines.
Authordangeles
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Data pipelines in research and production environments are prone to validation gaps, unhandled errors, and brittle recoverability, leading to downtime and inconsistent results.
Core Features & Use Cases
- Six-stage workflow design, including design, input validation, transform, output validation, error handling, and monitoring.
- Robust error handling patterns: retries with backoff, checkpointing, structured logging, and recovery from partial failures.
- Monitoring and observability across stages with dashboards and alerting; supports bioinformatics and data processing pipelines.
Quick Start
Define a blueprint describing the six-stage workflow and enable checkpointing. Then configure on your orchestrator to run a dry-run on a small dataset to observe progress and recovery behavior.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: data-pipeline-manager Download link: https://github.com/dangeles/claude/archive/main.zip#data-pipeline-manager Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.