detoxify
OfficialDetect toxic comments.
Legal & Compliance#natural language processing#content moderation#harmful content#toxicity detection#AI Act compliance#societal risk
AuthorDTMC-marketplace
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill helps identify and flag toxic comments and harmful content within text, contributing to safer online environments and compliance with content moderation policies.
Core Features & Use Cases
- Toxicity Classification: Detects various forms of toxicity including general toxicity, severe toxicity, obscenity, threats, insults, and identity-based attacks.
- Compliance Assessment: Assists in evaluating AI systems against regulatory requirements like Article 9 of the EU AI Act concerning societal risks.
- Use Case: A social media platform can use this skill to automatically flag potentially harmful user comments for review before they are published, reducing the spread of hate speech.
Quick Start
Use the detoxify skill to analyze the provided text for toxicity.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: detoxify Download link: https://github.com/DTMC-marketplace/governance/archive/main.zip#detoxify Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.