audit-extract
CommunityExtract footnotes from DOCX with formatting.
Authoredwinhu
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill addresses the challenge of extracting and structuring footnote data from DOCX documents, including preserving formatting and resolving cross-references, which is crucial for legal and academic citation management.
Core Features & Use Cases
- Footnote Extraction: Parses DOCX files to extract all footnotes.
- Formatting Preservation: Retains inline formatting (italic, bold, small caps) within extracted footnotes.
- URL Archiving: Extracts all URLs found within footnotes for an inventory.
- Citation Registry: Builds a mapping of hereinafter definitions and author-to-first-cite references.
- Cross-reference Resolution: Identifies and resolves
supra note [_]placeholders. - Use Case: Legal professionals can use this to automatically process lengthy legal briefs, ensuring all citations and references are correctly extracted and formatted for audit.
Quick Start
Run the extract_footnotes.py script with the path to your DOCX file to begin the extraction process.
Dependency Matrix
Required Modules
lxml
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: audit-extract Download link: https://github.com/edwinhu/workflows/archive/main.zip#audit-extract Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.