ml-data-fetch-annotate

Official

Fetch & annotate ML data for error analysis

AuthorStream-claims
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the process of fetching machine learning datasets from Redshift and generating annotated PDFs, streamlining the analysis of model errors and data quality.

Core Features & Use Cases

  • Data Fetching: Query and retrieve segment or case data from Redshift for ML evaluation.
  • PDF Annotation: Create annotated PDFs to visualize model predictions against ground truth and facilitate error analysis.
  • Schema Exploration: Understand the data warehouse schema relevant to ML evaluation.
  • Use Case: When a user needs to analyze why a classification model is making incorrect predictions, this skill can fetch the relevant data, generate reviewable PDFs highlighting the errors, and help identify patterns in misclassifications.

Quick Start

Fetch the latest 30 'BILL' segments for 'berkley_ent' and create an annotated PDF named 'segments.pdf'.

Dependency Matrix

Required Modules

pandasboto3PyMuPDF

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ml-data-fetch-annotate
Download link: https://github.com/Stream-claims/stream-engineering/archive/main.zip#ml-data-fetch-annotate

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.