to-markdown

Community

Convert files and URLs into clean Markdown

AuthorMathews-Tom
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Converts diverse file types and web content into clean, LLM-friendly Markdown to make documents searchable, ingestible, and readable without manual cleanup or format-specific tooling.

Core Features & Use Cases

  • Wide format support: Handles PDF, DOCX, PPTX, XLSX, HTML, images (EXIF + OCR), audio (transcription), CSV/JSON/XML, YouTube, EPUB, and more.
  • Robust fetch strategies: Uses trafilatura for static pages and Playwright for JS-rendered content, with YouTube transcript handling and escalation paths for paywalls and scanned PDFs.
  • Workflow-ready output: Produces Markdown files or inline content optimized for RAG, knowledge bases, and LLM pipelines with rules for tables, headings, and post-processing.
  • Error handling & escalation: Detects empty extractions, suggests OCR or Azure Document Intelligence, and warns on protected or paywalled content.

Quick Start

Convert the file /tmp/report.pdf to Markdown and save the output as /tmp/report.md.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: to-markdown
Download link: https://github.com/Mathews-Tom/praxis-skills/archive/main.zip#to-markdown

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.