Skill Explorer

Analyze evaluation logs for samples and metrics.

bytbroadley

nix-eval

Evaluate and inspect Nix expressions

byekala-project

read-eval-logs

View and analyze eval logs with Python.

byUKGovernmentBEIS

extract-eval-properties

Port AI evals to Inspect with feasibility

byEquiStamp

view-results

Analyze Hawk evaluation results.

byMETR

browser-use

Automate real-browser testing locally.

byNicktheQuickFTW

smalltalk

Interact with live Smalltalk images via MCP.

byCorporateSmalltalkConsultingLtd

hugging-face-evaluation

Streamline Hugging Face model evaluation outputs.

bycpich3g

create-inspect-task

Automate AI evaluation task creation with guided workflows.

byniznik-dev

debugging

Debug Nix evaluation and build issues.

byDaRacci

emacs-introspection

Discover Emacs functions, keys, and modes.

byjingtaozf

hugging-face-evaluation

Track model-card evaluations with ease and reliability.