pdf-to-md
CommunityTurn mixed PDFs into AI-friendly Markdown.
Authortaturou
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Converts mixed PDFs containing text, images, and charts into AI-friendly Markdown for easier understanding, indexing, and processing.
Core Features & Use Cases
- PDF type detection, text extraction, OCR to produce consistent Markdown.
- Batch processing to handle multiple PDFs in one run, with chunking for large documents.
- Structured output that preserves meaning (headings, tables, figures) and supports downstream AI workflows.
Quick Start
Convert a batch of PDFs by running the pipeline to output one Markdown file per document with images in an images/ directory.
Dependency Matrix
Required Modules
pdf2imagepillow
Components
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: pdf-to-md Download link: https://github.com/taturou/pdf-to-md-skill/archive/main.zip#pdf-to-md Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.