microimpute

Official

Fill missing survey data with ML imputation.

AuthorPolicyEngine
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Survey datasets often contain missing values, which can hinder accurate policy analysis. MicroImpute provides machine learning-based methods to intelligently fill these gaps while preserving statistical relationships, ensuring more complete and reliable data.

Core Features & Use Cases

  • Multiple Imputation Methods: Supports linear, random forest, quantile forest, XGBoost, and hot deck methods to predict missing values.
  • Quality Benchmarking: Compares different imputation methods using quantile loss to identify the most accurate approach for preserving data distribution.
  • Hyperparameter Tuning: Integrates Optuna for automatic optimization of imputation model parameters.
  • Use Case: Impute missing capital gains data in a survey dataset using a quantile forest model, leveraging patterns from a complete IRS tax record dataset, and then benchmark its quality.

Quick Start

Use the microimpute skill to impute missing capital_gains in your recipient DataFrame using donor data and income, age as common variables.

Dependency Matrix

Required Modules

microimputenumpypandasscikit-learnquantile-forestoptunastatsmodelsscipy

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: microimpute
Download link: https://github.com/PolicyEngine/policyengine-claude/archive/main.zip#microimpute

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.