mineru-ocr

Community

Convert documents to Markdown with OCR.

Authorcat-xierluo
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the conversion of various document types (PDF, Word, PPT, images) into Markdown format, including extracting text from scanned documents using OCR, recognizing tables, and identifying mathematical formulas.

Core Features & Use Cases

  • Multi-format Conversion: Supports PDF, DOC, DOCX, PPT, PPTX, PNG, JPG, JPEG.
  • OCR Capabilities: Extracts text from images and scanned documents.
  • Table and Formula Recognition: Preserves tabular data and identifies mathematical expressions.
  • Use Case: You have a scanned PDF of a research paper containing complex tables and equations. Use this Skill to convert it into a well-structured Markdown document, making the content easily searchable and editable.

Quick Start

Use the mineru-ocr skill to convert the file '/Users/user/Documents/report.pdf' to Markdown.

Dependency Matrix

Required Modules

None required

Components

scriptsconfigarchive

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: mineru-ocr
Download link: https://github.com/cat-xierluo/legal-skills/archive/main.zip#mineru-ocr

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.