Edge Case Handling in html-to-markdown

Official

Robust HTML to Markdown conversion.

Authorkreuzberg-dev
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill ensures reliable conversion of HTML to Markdown by meticulously handling a wide array of edge cases, including binary data, encoding issues, and malformed HTML.

Core Features & Use Cases

  • Binary Data Detection: Prevents processing of non-HTML files (e.g., ZIP, PDF, GZIP) through multi-layered checks.
  • UTF-16 Handling: Detects and flags UTF-16 encoded HTML, guiding users to decode to UTF-8.
  • Malformed HTML Recovery: Employs robust parsing strategies to gracefully handle unclosed tags, mismatched nesting, and invalid characters.
  • Use Case: Convert a webpage that contains mixed encodings or slightly broken HTML structure into clean Markdown without errors.

Quick Start

Convert the provided HTML content, ensuring that any malformed elements are handled gracefully.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: Edge Case Handling in html-to-markdown
Download link: https://github.com/kreuzberg-dev/html-to-markdown/archive/main.zip#edge-case-handling-in-html-to-markdown

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.