Character Removal for Watermark

Community

Disrupt LLM watermarks with character edits.

Authorzzw4257
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill addresses the challenge of detecting and disrupting hidden watermarks embedded within text generated by Large Language Models (LLMs).

Core Features & Use Cases

  • Watermark Disruption: Applies character-level perturbations to text to break or weaken LLM watermarks.
  • Watermark Detection Aid: Can be used in conjunction with detection tools to test the robustness of watermarking techniques.
  • Use Case: A researcher wants to test if a new LLM's watermark can be bypassed. They use this Skill to modify the LLM's output and then attempt to detect the watermark on the modified text.

Quick Start

Run the character removal tool on the provided text file 'output.txt'.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: Character Removal for Watermark
Download link: https://github.com/zzw4257/security-skills/archive/main.zip#character-removal-for-watermark

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.