data-processing-workflows

Official

Coordinate and automate data processing pipelines.

Authorsage-base
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill clarifies and codifies the data processing pipelines required to transform raw meeting minutes, web-scraped data, and politician information into structured analytics. It helps teams align on the sequence, dependencies, and responsibilities across minutes processing, scraping, and matching workflows, reducing integration errors and onboarding time.

Core Features & Use Cases

  • End-to-end pipeline orchestration covering Minutes Divider, Speaker Extraction, Politician Matching, and Affiliations creation.
  • Web Scraping with optional Google Cloud Storage (GCS) uploads and URI management for later processing.
  • Conference Member Extraction with staged processing and LLM-assisted matching to link members to politicians.
  • Clear, scriptable command interface to run each stage and enforce the required processing order.

Quick Start

Activate the data-processing-workflows skill when working on minutes processing or web scraping. Then run typical sequences such as:

  • sagebase process-minutes --pdf-path /path/to/minutes.pdf
  • sagebase extract-speakers-from-minutes
  • sagebase match-speakers-to-politicians
  • sagebase scrape-minutes --council-id 123 --upload-to-gcs

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: data-processing-workflows
Download link: https://github.com/sage-base/sagebase/archive/main.zip#data-processing-workflows

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.