benchmark-runner

Community

Consistent, reproducible benchmarks for decisions.

AuthorMathews-Tom
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Standardizes the evaluation and comparison of algorithms, models, or implementations by producing structured benchmark reports and reproducible results.

Core Features & Use Cases

  • Metric selection and design: latency, throughput, memory, accuracy, with test-case design and environment capture.
  • Reproducible reports: generation of comparison tables and trade-off analysis to support decision making.
  • Use Case: When you need to choose between two model implementations, run standardized benchmarks and compare their performance and resource usage across representative workloads.

Quick Start

Run the benchmark runner on two or more implementations with a defined workload and capture hardware/software context to compare results.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: benchmark-runner
Download link: https://github.com/Mathews-Tom/praxis-skills/archive/main.zip#benchmark-runner

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.