long-document-llm-pipeline
CommunityEfficient LLM processing for long documents.
Software Engineering#llm#batching#long-document#cost-reduction#document-pipeline#section-splitting#breadcrumb-context
Authorshimo4228
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Long documents (>50K characters) used in LLM workflows suffer from attention drift, excessive input costs, inability to retry partial sections, and lack of parallel processing. This Skill provides a robust 6-step pipeline to structure, chunk, and batch-process content while preserving section context.
Core Features & Use Cases
- Section-based extraction and processing: convert large inputs into manageable chunks with breadcrumb context for accurate LLM results.
- Cost-aware batching and routing: parallelize requests and route models by section size to minimize costs.
- Applied to PDFs, transcripts, and multilingual reports with inherent structure, enabling flashcards, summaries, or Q&A datasets.
Quick Start
Process a document larger than 50K characters through the pipeline to generate per-section prompts and batched LLM results.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: long-document-llm-pipeline Download link: https://github.com/shimo4228/claude-code-learned-skills/archive/main.zip#long-document-llm-pipeline Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.