doc-parser
OfficialUnlock document content with advanced parsing.
Content & Communication#ocr#pdf extraction#document parsing#data structuring#docling#layout analysis
Authorclaude-office-skills
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill tackles the challenge of extracting and structuring information from various document formats, including complex PDFs, Word documents, and images, preserving their original layout and content.
Core Features & Use Cases
- Advanced Document Parsing: Utilizes the
doclinglibrary for state-of-the-art document understanding. - Structure Preservation: Maintains the original layout, tables, figures, and multi-column text flow.
- Multi-format Support: Handles PDFs (native and scanned), Word documents, images, and HTML.
- Use Case: Convert a research paper into structured Markdown, extract all tables from a financial report, or parse an academic paper to identify its title, abstract, sections, and references.
Quick Start
Use the doc-parser skill to convert the attached document 'research_paper.pdf' into structured markdown.
Dependency Matrix
Required Modules
docling
Components
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: doc-parser Download link: https://github.com/claude-office-skills/skills/archive/main.zip#doc-parser Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.