schema-reference

Community

Validate schemas, generate accurate PySpark ETL.

Authorlinus-mcmanamey
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill prevents common schema-related errors and ensures business logic compliance when generating PySpark ETL code. It automates the process of querying actual schemas, extracting business rules from data dictionaries, and comparing schemas between data layers, guaranteeing that generated code is accurate and robust.

Core Features & Use Cases

  • Dynamic Schema Querying: Retrieve exact column names, data types, and constraints from DuckDB warehouse.
  • Business Logic Extraction: Parse data dictionary files to understand relationships, default values, and data quality rules.
  • Cross-Layer Schema Comparison: Identify differences and required transformations between Bronze, Silver, and Gold layer schemas.
  • Use Case: Before writing a new PySpark transformation for a Silver layer table, use this skill to query the Bronze source schema, extract relevant business rules from the data dictionary, and compare it against the target Silver schema to ensure all transformations are correctly defined.

Quick Start

Explain the steps to create a new Silver layer table named 's_customer_case' from 'bronze_cms.b_customer_case', ensuring all schema and business logic are correctly applied.

Dependency Matrix

Required Modules

duckdb

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: schema-reference
Download link: https://github.com/linus-mcmanamey/multi-agent-user-story-development/archive/main.zip#schema-reference

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.