docx-smart-extractor
CommunityExtract Word docs with token-efficient, structured results.
Authordiegocconsolini
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Local, zero-LLM extraction of Word documents (1MB-50MB+) that preserves structure, formatting, and tables. Enables fast querying and downstream processing without sending data to LLMs.
Core Features & Use Cases
- Preserve headings, tables, metadata, and formatting.
- Semantic chunking for efficient querying and content navigation.
- Caching for instant reuse and offline processing.
Quick Start
Use a Word document path (docx) and run extraction; then query via the cache to retrieve structured chunks.
Dependency Matrix
Required Modules
python-docx
Components
scriptsreferencesassets
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: docx-smart-extractor Download link: https://github.com/diegocconsolini/ClaudeSkillCollection/archive/main.zip#docx-smart-extractor Please download this .zip file, extract it, and install it in the .claude/skills/ directory.