System Documentation

What problem does it solve?

This Skill helps identify and flag toxic comments and harmful content within text, contributing to safer online environments and compliance with content moderation policies.

Core Features & Use Cases

  • Toxicity Classification: Detects various forms of toxicity including general toxicity, severe toxicity, obscenity, threats, insults, and identity-based attacks.
  • Compliance Assessment: Assists in evaluating AI systems against regulatory requirements like Article 9 of the EU AI Act concerning societal risks.
  • Use Case: A social media platform can use this skill to automatically flag potentially harmful user comments for review before they are published, reducing the spread of hate speech.

Quick Start

Use the detoxify skill to analyze the provided text for toxicity.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: detoxify
Download link: https://github.com/DTMC-marketplace/governance/archive/main.zip#detoxify

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.