data-processing-workflows
OfficialCoordinate and automate data processing pipelines.
Data & Analytics#llm#workflows#web-scraping#data-processing#speaker-extraction#minutes-processing#politician-matching
Authorsage-base
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill clarifies and codifies the data processing pipelines required to transform raw meeting minutes, web-scraped data, and politician information into structured analytics. It helps teams align on the sequence, dependencies, and responsibilities across minutes processing, scraping, and matching workflows, reducing integration errors and onboarding time.
Core Features & Use Cases
- End-to-end pipeline orchestration covering Minutes Divider, Speaker Extraction, Politician Matching, and Affiliations creation.
- Web Scraping with optional Google Cloud Storage (GCS) uploads and URI management for later processing.
- Conference Member Extraction with staged processing and LLM-assisted matching to link members to politicians.
- Clear, scriptable command interface to run each stage and enforce the required processing order.
Quick Start
Activate the data-processing-workflows skill when working on minutes processing or web scraping. Then run typical sequences such as:
- sagebase process-minutes --pdf-path /path/to/minutes.pdf
- sagebase extract-speakers-from-minutes
- sagebase match-speakers-to-politicians
- sagebase scrape-minutes --council-id 123 --upload-to-gcs
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: data-processing-workflows Download link: https://github.com/sage-base/sagebase/archive/main.zip#data-processing-workflows Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.