Name: safety-filter-bypass
Availability: InStock
Author: pluginagentmarketplace

System Documentation

What problem does it solve?

This Skill enables authorized testers to assess and bypass AI safety filters and guardrails to identify weaknesses in protective mechanisms.

Core Features & Use Cases

Testing: Evaluate pre-filter, post-filter, and guidance robustness across LLM deployments.
Technique catalog: Includes encoding, semantic, structural, and multimodal bypass strategies with ready-to-run examples.
Ethical testing: Provides guidelines for responsible disclosure and consent-based testing in controlled environments.

Quick Start

Run authorized tests with the included harness to execute encoding and semantic bypass variants against your configured filters.

Please help me install this Skill: Name: safety-filter-bypass Download link: https://github.com/pluginagentmarketplace/custom-plugin-ai-red-teaming/archive/main.zip#safety-filter-bypass Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

safety-filter-bypass

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper