ocr-and-documents

Community

Extract text from any document.

AuthorAum08Desai
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the extraction of text and data from various document formats, including PDFs, scanned documents, and images, eliminating manual data entry and content retrieval bottlenecks.

Core Features & Use Cases

  • Text Extraction: Retrieves text from text-based PDFs and scanned documents using OCR.
  • Document Parsing: Handles complex layouts, tables, equations, and code blocks from various file types.
  • Remote URL Processing: Extracts content directly from URLs, simplifying access to online documents.
  • Use Case: Automatically extract all text and tables from a research paper PDF, a scanned business report, or a presentation slide to quickly gather information for analysis.

Quick Start

Use the ocr-and-documents skill to extract all text from the file named 'report.pdf'.

Dependency Matrix

Required Modules

pymupdfpymupdf4llmmarker-pdfpython-docxpython-pptx

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ocr-and-documents
Download link: https://github.com/Aum08Desai/hermes-research-agent/archive/main.zip#ocr-and-documents

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.