Name: JailFuzzer
Availability: InStock
Author: zzw4257

System Documentation

What problem does it solve?

This Skill addresses the challenge of ensuring content safety in LLM-based text-to-image models by systematically testing for and identifying jailbreaking vulnerabilities.

Core Features & Use Cases

LLM-based Fuzzing: Utilizes LLM agents to generate adversarial prompts designed to bypass safety filters.
Content Safety Testing: Specifically targets text-to-image models to uncover prompt injection vulnerabilities.
Use Case: A developer can use this Skill to proactively test their new text-to-image model for potential misuse before public release, ensuring it adheres to safety guidelines.

Quick Start

Use the JailFuzzer skill to scan the attached file 'test_prompts.txt' for vulnerabilities.

Please help me install this Skill: Name: JailFuzzer Download link: https://github.com/zzw4257/security-skills/archive/main.zip#jailfuzzer Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

JailFuzzer

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper