Name: liff-catalog-pdf-extraction
Availability: InStock
Author: EdwardSalkeld

System Documentation

What problem does it solve?

This Skill automates the extraction of structured film data from PDF catalog pages, which often contain a mix of film entries and other non-film content.

Core Features & Use Cases

Multi-pass Extraction: Handles complex PDFs by performing text extraction, page classification, film block segmentation, and JSON normalization.
Selective Extraction: Skips non-film pages (like introductions, indexes, adverts) and logs the reasons.
Structured Output: Generates one JSON file per film with detailed metadata, or logs skipped pages.
Use Case: Process a batch of LIFF catalog PDFs to automatically create a structured database of all films featured in the catalog, including details like title, director, runtime, and description.

Quick Start

Use the liff-catalog-pdf-extraction skill to extract film data from the file 'page-10.pdf'.

Please help me install this Skill: Name: liff-catalog-pdf-extraction Download link: https://github.com/EdwardSalkeld/liff-archive/archive/main.zip#liff-catalog-pdf-extraction Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

liff-catalog-pdf-extraction

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper