haipipe-data-4-aidata
CommunityPrepare data for ML models.
Data & Analytics#machine learning#feature engineering#data pipeline#data transformation#dataset splitting#AIDataSet
Authorjluo41
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill transforms raw case data into machine-learning-ready datasets, handling complex splitting and feature transformations required for model training.
Core Features & Use Cases
- Data Splitting: Divides datasets into training, validation, and testing sets using various strategies (time-based, random, stratified).
- Feature Transformation: Converts raw case features into formats suitable for ML models (e.g., token embeddings, numerical sequences).
- Use Case: Prepare a large patient dataset for a diabetes prediction model by splitting it into train/validation/test sets and transforming time-series CGM data into token sequences.
Quick Start
Use the haipipe-data-4-aidata skill to cook a new AIDataSet using the provided configuration file.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferencesassets
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: haipipe-data-4-aidata Download link: https://github.com/jluo41/research-skills/archive/main.zip#haipipe-data-4-aidata Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.