unstructured-pdf-generation

Community

Generate synthetic PDFs for RAG

AuthorLaurentPRAT-DB
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill automates the creation of realistic synthetic PDF documents, which is crucial for testing and developing Retrieval-Augmented Generation (RAG) systems and handling unstructured data.

Core Features & Use Cases

  • Synthetic PDF Generation: Creates professional PDF documents with LLM-generated content based on detailed descriptions.
  • RAG Data Preparation: Generates accompanying JSON files with questions and evaluation guidelines, ideal for RAG testing.
  • Automated Upload: Uploads generated PDFs and JSONs to Unity Catalog Volumes for easy access.
  • Use Case: Generate 20 technical documentation PDFs for a new SaaS platform to populate a vector database for a RAG system.

Quick Start

Use the unstructured-pdf-generation skill to generate 10 technical documentation PDFs for a cloud infrastructure platform, saving them to the 'my_catalog' catalog and 'my_schema' schema.

Dependency Matrix

Required Modules

None required

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: unstructured-pdf-generation
Download link: https://github.com/LaurentPRAT-DB/LPT_claude_config/archive/main.zip#unstructured-pdf-generation

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.