audit-extract

Community

Extract footnotes from DOCX with formatting.

Authoredwinhu
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill addresses the challenge of extracting and structuring footnote data from DOCX documents, including preserving formatting and resolving cross-references, which is crucial for legal and academic citation management.

Core Features & Use Cases

  • Footnote Extraction: Parses DOCX files to extract all footnotes.
  • Formatting Preservation: Retains inline formatting (italic, bold, small caps) within extracted footnotes.
  • URL Archiving: Extracts all URLs found within footnotes for an inventory.
  • Citation Registry: Builds a mapping of hereinafter definitions and author-to-first-cite references.
  • Cross-reference Resolution: Identifies and resolves supra note [_] placeholders.
  • Use Case: Legal professionals can use this to automatically process lengthy legal briefs, ensuring all citations and references are correctly extracted and formatted for audit.

Quick Start

Run the extract_footnotes.py script with the path to your DOCX file to begin the extraction process.

Dependency Matrix

Required Modules

lxml

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: audit-extract
Download link: https://github.com/edwinhu/workflows/archive/main.zip#audit-extract

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.