extract-district-data
CommunityExtract district data, preserve integrity.
Data & Analytics#data extraction#ETL#PostgreSQL#data migration#Parquet#Neo4j#referential integrity#data staging
Authorpauljbernard
Version1.0.0
Installs0
System Documentation
What problem does it solves? Manually extracting district-specific data from multiple, interconnected databases while maintaining referential integrity is a complex and error-prone task. This Skill automates the entire extraction process, ensuring accuracy and consistency.
Core Features & Use Cases
- Dependency-Aware Extraction: Loads schema and dependency order to extract tables in the correct sequence, preserving all foreign key relationships.
- Intelligent Query Building: Dynamically builds district-filtered queries, handling direct and indirect relationships across relational and graph databases.
- Use Case: Given a district ID, autonomously extract all relevant data from 5 different data stores (PostgreSQL and Neo4j), ensuring all foreign key relationships are preserved and data is consistent. The extracted data is then staged for subsequent anonymization.
Quick Start
Extract all data for 'district-001' from PROD databases, ensuring referential integrity and saving to staging.
Dependency Matrix
Required Modules
pandaspyarrowneo4j-driversqlalchemypsycopg2-binary
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: extract-district-data Download link: https://github.com/pauljbernard/datamig/archive/main.zip#extract-district-data Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.