pdf-to-markdown

Community

Turn PDFs into full-context Markdown.

Authoraliceisjustplaying
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill converts entire PDFs into clean, structured Markdown that can be loaded into context for analysis, preserving formatting and embedded content.

Core Features & Use Cases

  • Full-document extraction: Convert all text, headers, tables, lists, and images from a PDF into Markdown for context loading.
  • Table and image handling: Uses IBM Docling's TableFormer AI model for accurate tables and extracts images to a cache for reference.
  • Use Case: Ideal when you need the entire document content available for in-context analysis or long-form review.

Quick Start

Run: python pdf_to_md.py document.pdf to generate document.md with images placed alongside.

Dependency Matrix

Required Modules

pymupdfdoclingdocling-core

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: pdf-to-markdown
Download link: https://github.com/aliceisjustplaying/claude-resources-monorepo/archive/main.zip#pdf-to-markdown

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.