ocr-document-processor
CommunityUnlock text from any document.
Authordkyazzentwatwa
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill overcomes the challenge of extracting text from images and scanned documents, making them searchable, editable, and processable.
Core Features & Use Cases
- Image & PDF OCR: Extracts text from various image formats (PNG, JPEG) and scanned PDFs.
- Multi-language Support: Handles over 100 languages for global document processing.
- Structured Output: Provides text in plain text, Markdown, JSON, or HTML, and can extract tables to CSV.
- Use Case: Automatically convert a stack of scanned receipts into a structured JSON file for expense reporting.
Quick Start
Use the ocr-document-processor skill to extract all text from the file 'receipt.png'.
Dependency Matrix
Required Modules
pytesseractPillowPyMuPDFopencv-pythonnumpy
Components
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ocr-document-processor Download link: https://github.com/dkyazzentwatwa/chatgpt-skills/archive/main.zip#ocr-document-processor Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.