ai-data-engineering

Community

Build AI data infrastructure

Authorancoleman
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill enables the creation and management of robust data infrastructure essential for AI/ML systems, including RAG pipelines, feature stores, and embedding generation.

Core Features & Use Cases

  • RAG Pipelines: Build end-to-end RAG systems from ingestion to evaluation.
  • Feature Stores: Implement ML feature serving with Feast to prevent training-serving skew.
  • Embedding Generation: Create high-quality embeddings using state-of-the-art models.
  • Orchestration: Manage complex data workflows with Dagster or Prefect.
  • Use Case: Develop a RAG pipeline for customer support documentation, enabling semantic search and question answering over your knowledge base.

Quick Start

Use the ai-data-engineering skill to set up a basic RAG pipeline by chunking documents and generating embeddings.

Dependency Matrix

Required Modules

langchainlangchain-corelangchain-openailangchain-voyageailangchain-qdrantqdrant-clientragasdatasetsfeastdagsterdagster-webserverlakefs-client

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ai-data-engineering
Download link: https://github.com/ancoleman/ai-design-components/archive/main.zip#ai-data-engineering

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.