microimpute
OfficialFill missing survey data with ML imputation.
Data & Analytics#data science#machine learning#quantile forest#missing data#data imputation#survey data#statistical modeling
AuthorPolicyEngine
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Survey datasets often contain missing values, which can hinder accurate policy analysis. MicroImpute provides machine learning-based methods to intelligently fill these gaps while preserving statistical relationships, ensuring more complete and reliable data.
Core Features & Use Cases
- Multiple Imputation Methods: Supports linear, random forest, quantile forest, XGBoost, and hot deck methods to predict missing values.
- Quality Benchmarking: Compares different imputation methods using quantile loss to identify the most accurate approach for preserving data distribution.
- Hyperparameter Tuning: Integrates Optuna for automatic optimization of imputation model parameters.
- Use Case: Impute missing capital gains data in a survey dataset using a quantile forest model, leveraging patterns from a complete IRS tax record dataset, and then benchmark its quality.
Quick Start
Use the microimpute skill to impute missing capital_gains in your recipient DataFrame using donor data and income, age as common variables.
Dependency Matrix
Required Modules
microimputenumpypandasscikit-learnquantile-forestoptunastatsmodelsscipy
Components
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: microimpute Download link: https://github.com/PolicyEngine/policyengine-claude/archive/main.zip#microimpute Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.