Name: models-test
Availability: InStock
Author: sunchendd

System Documentation

What problem does it solve?

大模型测试自动化框架，用于在华为昇腾NPU上进行VLLM和MindIE推理性能及精度评估。支持自动化NPU资源管理、Docker容器化部署和并行测试。当用户提出以下请求时请使用此技能：测试大模型性能或精度、运行VLLM或MindIE基准测试、生成模型测试命令、或请求执行测试工作流。 Agent需要逐步完整执行测试工作流程，确保资源管理和错误处理。

Core Features & Use Cases

自动化测试：VLLM与MindIE性能测试、EvalScope对比评估、资源分配与容器管理。
并行与可扩展性：多模型并行测试、动态NPU分配、结果集中汇总和报告。
Use Case: 在CI/CD流水线中对所有上新模型执行端到端基准测试，确保部署前的稳定性与性能。

Quick Start

Run an automated benchmark suite for your models and retrieve a structured results report.

Please help me install this Skill: Name: models-test Download link: https://github.com/sunchendd/good_skills/archive/main.zip#models-test Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

models-test

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper