BAIT

Community

Scan LLMs for backdoors.

Authorzzw4257
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill addresses the emerging threat of backdoors embedded within Large Language Models (LLMs), enabling proactive detection and mitigation.

Core Features & Use Cases

  • LLM Backdoor Detection: Identifies hidden malicious functionalities within LLMs.
  • Inverted Attack Target: Utilizes a novel approach to uncover vulnerabilities by inverting the attack vector.
  • Use Case: Security researchers and AI developers can use this Skill to audit LLMs before deployment, ensuring they are free from adversarial manipulations.

Quick Start

Use the BAIT skill to scan the attached LLM model for potential backdoors.

Dependency Matrix

Required Modules

None required

Components

scriptsreferencesassets

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: BAIT
Download link: https://github.com/zzw4257/security-skills/archive/main.zip#bait

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.