ai-data-engineering
CommunityBuild AI data infrastructure
Authorancoleman
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill enables the creation and management of robust data infrastructure essential for AI/ML systems, including RAG pipelines, feature stores, and embedding generation.
Core Features & Use Cases
- RAG Pipelines: Build end-to-end RAG systems from ingestion to evaluation.
- Feature Stores: Implement ML feature serving with Feast to prevent training-serving skew.
- Embedding Generation: Create high-quality embeddings using state-of-the-art models.
- Orchestration: Manage complex data workflows with Dagster or Prefect.
- Use Case: Develop a RAG pipeline for customer support documentation, enabling semantic search and question answering over your knowledge base.
Quick Start
Use the ai-data-engineering skill to set up a basic RAG pipeline by chunking documents and generating embeddings.
Dependency Matrix
Required Modules
langchainlangchain-corelangchain-openailangchain-voyageailangchain-qdrantqdrant-clientragasdatasetsfeastdagsterdagster-webserverlakefs-client
Components
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ai-data-engineering Download link: https://github.com/ancoleman/ai-design-components/archive/main.zip#ai-data-engineering Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.