mineru-ocr
CommunityConvert documents to Markdown with OCR.
Content & Communication#ocr#document conversion#text extraction#pdf to markdown#scanned documents#table recognition
Authorcat-xierluo
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill automates the conversion of various document types (PDF, Word, PPT, images) into Markdown format, including extracting text from scanned documents using OCR, recognizing tables, and identifying mathematical formulas.
Core Features & Use Cases
- Multi-format Conversion: Supports PDF, DOC, DOCX, PPT, PPTX, PNG, JPG, JPEG.
- OCR Capabilities: Extracts text from images and scanned documents.
- Table and Formula Recognition: Preserves tabular data and identifies mathematical expressions.
- Use Case: You have a scanned PDF of a research paper containing complex tables and equations. Use this Skill to convert it into a well-structured Markdown document, making the content easily searchable and editable.
Quick Start
Use the mineru-ocr skill to convert the file '/Users/user/Documents/report.pdf' to Markdown.
Dependency Matrix
Required Modules
None requiredComponents
scriptsconfigarchive
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: mineru-ocr Download link: https://github.com/cat-xierluo/legal-skills/archive/main.zip#mineru-ocr Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.