ocr-backend-management
OfficialOrchestrate OCR backends with plug-and-play.
Authorkreuzberg-dev
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This skill solves the challenge of coordinating multiple OCR engines (Tesseract, PaddleOCR, EasyOCR) within a single, pluggable backend framework, enabling dynamic backend selection, health checks, and consistent processing pipelines.
Core Features & Use Cases
- Pluggable OCR backend registry with health checks and backend health status.
- Support for multiple engines (Tesseract, PaddleOCR, EasyOCR) with language validation and optional auto-download.
- Preprocessing, language detection integration, and caching to boost performance and reliability.
- Table detection and HOCR support to extract structured data from complex layouts.
Quick Start
Register a backend, configure the OCR processor, and run a sample image to verify correct engine selection and output.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ocr-backend-management Download link: https://github.com/kreuzberg-dev/kreuzberg/archive/main.zip#ocr-backend-management Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.