data-lake-architect
CommunityArchitect scalable data lake patterns.
AuthorEmilLindfors
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Provides architectural guidance for designing scalable data lakes and lakehouse patterns, including partitioning, storage layout, and schema design.
Core Features & Use Cases
- Three-tier storage model: Raw, Processed, Curated with clear data lineage.
- Partitioning strategies: Time-based, multi-dimensional, and hashing approaches; Iceberg considerations.
- Schema design and evolution: Wide tables vs normalized designs; strategies for schema evolution.
- Storage layout & lifecycle: Tiered retention and data lifecycle management for cost and accessibility.
Quick Start
Propose a three-tier data lake layout (raw/processed/curated) with date-based partitioning for ingested data.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: data-lake-architect Download link: https://github.com/EmilLindfors/claude-marketplace/archive/main.zip#data-lake-architect Please download this .zip file, extract it, and install it in the .claude/skills/ directory.