cheap-scan1
CommunityToken-efficient PDF processing
Education & Research#pdf#consolidation#pymupdf#token-efficiency#text-extraction#document-triage#vision-routing
Authornaj2r
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Cheap-scan1 dramatically reduces API token usage and cost when ingesting and analyzing academic PDFs by extracting text locally, triaging relevance, and only invoking vision models for pages that need them.
Core Features & Use Cases
- Local zero-token text extraction using pymupdf with per-page quality metrics to detect scanned PDFs and trigger fallbacks.
- Progressive triage funnel (abstract → introduction → conclusion → selective sections) and a --safe mode for exhaustive reads with per-section relevance tags.
- Pattern-driven scanner routing for tables, figures, and equations plus optional page rendering and parallel visual/equation analysis, producing a consolidated notes.md compatible with downstream literature workflows.
- Use cases: large-scale literature reviews, selective law-review processing, batch triage of working papers, and cost-conscious vision calls for empirical research.
Quick Start
Run the cheap-scan1 pipeline on path/to/paper.pdf in default mode to produce a consolidated notes.md in the specified output directory.
Dependency Matrix
Required Modules
pymupdf
Components
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: cheap-scan1 Download link: https://github.com/naj2r/claude-econ-paper-template/archive/main.zip#cheap-scan1 Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.