benchmark-runner
CommunityConsistent, reproducible benchmarks for decisions.
AuthorMathews-Tom
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Standardizes the evaluation and comparison of algorithms, models, or implementations by producing structured benchmark reports and reproducible results.
Core Features & Use Cases
- Metric selection and design: latency, throughput, memory, accuracy, with test-case design and environment capture.
- Reproducible reports: generation of comparison tables and trade-off analysis to support decision making.
- Use Case: When you need to choose between two model implementations, run standardized benchmarks and compare their performance and resource usage across representative workloads.
Quick Start
Run the benchmark runner on two or more implementations with a defined workload and capture hardware/software context to compare results.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: benchmark-runner Download link: https://github.com/Mathews-Tom/praxis-skills/archive/main.zip#benchmark-runner Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.