ocr-document-processor

Community

Unlock text from any document.

Authordkyazzentwatwa
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill overcomes the challenge of extracting text from images and scanned documents, making them searchable, editable, and processable.

Core Features & Use Cases

  • Image & PDF OCR: Extracts text from various image formats (PNG, JPEG) and scanned PDFs.
  • Multi-language Support: Handles over 100 languages for global document processing.
  • Structured Output: Provides text in plain text, Markdown, JSON, or HTML, and can extract tables to CSV.
  • Use Case: Automatically convert a stack of scanned receipts into a structured JSON file for expense reporting.

Quick Start

Use the ocr-document-processor skill to extract all text from the file 'receipt.png'.

Dependency Matrix

Required Modules

pytesseractPillowPyMuPDFopencv-pythonnumpy

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: ocr-document-processor
Download link: https://github.com/dkyazzentwatwa/chatgpt-skills/archive/main.zip#ocr-document-processor

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.