huggingface-evaluate

Official

Evaluate ML models & datasets

AuthorDTMC-marketplace
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill streamlines the evaluation of machine learning models and datasets, particularly those hosted on Hugging Face, ensuring performance and compliance.

Core Features & Use Cases

  • Model & Dataset Evaluation: Access over 100 metrics for accuracy, fairness, bias, and environmental impact.
  • Compliance Assessment: Evaluate AI systems against EU AI Act Article 15 requirements.
  • Risk Mitigation: Implement controls for performance-related risks.
  • Use Case: Evaluate a new natural language processing model for bias before deploying it in a customer-facing application.

Quick Start

Use the huggingface-evaluate skill to assess the bias of the model 'bert-base-uncased' on the dataset 'imdb'.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: huggingface-evaluate
Download link: https://github.com/DTMC-marketplace/governance/archive/main.zip#huggingface-evaluate

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.