to-markdown
CommunityConvert files and URLs into clean Markdown
AuthorMathews-Tom
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Converts diverse file types and web content into clean, LLM-friendly Markdown to make documents searchable, ingestible, and readable without manual cleanup or format-specific tooling.
Core Features & Use Cases
- Wide format support: Handles PDF, DOCX, PPTX, XLSX, HTML, images (EXIF + OCR), audio (transcription), CSV/JSON/XML, YouTube, EPUB, and more.
- Robust fetch strategies: Uses trafilatura for static pages and Playwright for JS-rendered content, with YouTube transcript handling and escalation paths for paywalls and scanned PDFs.
- Workflow-ready output: Produces Markdown files or inline content optimized for RAG, knowledge bases, and LLM pipelines with rules for tables, headings, and post-processing.
- Error handling & escalation: Detects empty extractions, suggests OCR or Azure Document Intelligence, and warns on protected or paywalled content.
Quick Start
Convert the file /tmp/report.pdf to Markdown and save the output as /tmp/report.md.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: to-markdown Download link: https://github.com/Mathews-Tom/praxis-skills/archive/main.zip#to-markdown Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.