pdf-to-markdown-converter

Official

PDFs to Markdown: Text & Images, instantly.

Authortalent-factory
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Manually extracting text and images from PDFs is a tedious, time-consuming task that locks valuable information within static documents. This skill liberates your content, transforming complex PDFs into easily editable and searchable Markdown, saving you hours of manual effort and enabling seamless content reuse.

Core Features & Use Cases

  • PDF to Markdown Conversion: Converts entire PDF documents into structured Markdown, preserving text layout.
  • Image Extraction: Automatically extracts all embedded images and saves them as PNG files, referenced within the Markdown.
  • Smart Text Correction: Automatically fixes common LaTeX umlaut encoding issues (e.g., ¨a to ä) and converts ß to ss (Swiss German style), ensuring accurate text representation.
  • Use Case: Quickly convert research papers, reports, or scanned documents into an editable format for analysis, content repurposing, or integration into knowledge bases, without losing images or struggling with formatting.

Quick Start

Convert the attached PDF 'document.pdf' to Markdown, including all images, and save the output in a new folder named 'converted_docs'.

Dependency Matrix

Required Modules

PyMuPDFPillow

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: pdf-to-markdown-converter
Download link: https://github.com/talent-factory/dotfiles/archive/main.zip#pdf-to-markdown-converter

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository