ocr-and-documents
CommunityExtract text from any document.
AuthorAum08Desai
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill automates the extraction of text and data from various document formats, including PDFs, scanned documents, and images, eliminating manual data entry and content retrieval bottlenecks.
Core Features & Use Cases
- Text Extraction: Retrieves text from text-based PDFs and scanned documents using OCR.
- Document Parsing: Handles complex layouts, tables, equations, and code blocks from various file types.
- Remote URL Processing: Extracts content directly from URLs, simplifying access to online documents.
- Use Case: Automatically extract all text and tables from a research paper PDF, a scanned business report, or a presentation slide to quickly gather information for analysis.
Quick Start
Use the ocr-and-documents skill to extract all text from the file named 'report.pdf'.
Dependency Matrix
Required Modules
pymupdfpymupdf4llmmarker-pdfpython-docxpython-pptx
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ocr-and-documents Download link: https://github.com/Aum08Desai/hermes-research-agent/archive/main.zip#ocr-and-documents Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.