docx-smart-extractor

Community

Extract Word docs with token-efficient, structured results.

Authordiegocconsolini
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Local, zero-LLM extraction of Word documents (1MB-50MB+) that preserves structure, formatting, and tables. Enables fast querying and downstream processing without sending data to LLMs.

Core Features & Use Cases

  • Preserve headings, tables, metadata, and formatting.
  • Semantic chunking for efficient querying and content navigation.
  • Caching for instant reuse and offline processing.

Quick Start

Use a Word document path (docx) and run extraction; then query via the cache to retrieve structured chunks.

Dependency Matrix

Required Modules

python-docx

Components

scriptsreferencesassets

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: docx-smart-extractor
Download link: https://github.com/diegocconsolini/ClaudeSkillCollection/archive/main.zip#docx-smart-extractor

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository