Name: score-tasks
Availability: InStock
Author: sourcegraph

System Documentation

What problem does it solve?

This Skill addresses the need for consistent and objective evaluation of benchmark tasks, ensuring their clarity, verifiability, and reproducibility.

Core Features & Use Cases

Automated Quality Scoring: Assigns scores based on instruction clarity, verifier quality, and reproducibility.
Identification of Weaknesses: Flags tasks that fall below a specified quality threshold, highlighting areas for improvement.
Use Case: A benchmark curator can use this Skill to automatically assess a new set of tasks, ensuring they meet the required standards before being added to the benchmark suite.

Quick Start

Use the score-tasks skill to score all tasks in the csb_sdlc_pytorch suite and display the results in a table.

Please help me install this Skill: Name: score-tasks Download link: https://github.com/sourcegraph/CodeScaleBench/archive/main.zip#score-tasks Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

score-tasks

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper