experiment
CommunityRun and monitor ML training experiments.
AuthorChuaHanChong
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill automates the execution, monitoring, and result parsing of individual machine learning training experiments, ensuring reliable and reproducible runs.
Core Features & Use Cases
- Experiment Execution: Runs training scripts with specified configurations on designated GPUs.
- Environment Setup: Manages code environments using git worktrees for isolated experiments.
- Monitoring & Error Handling: Includes pre-flight checks, early abort detection, and an auto-repair loop for common training failures.
- Use Case: When optimizing hyperparameters, this Skill is used to launch each individual training run with a unique set of parameters, logging its progress and outcome.
Quick Start
Execute the training script for experiment 'exp-001' on GPU 0 with the provided configuration.
Dependency Matrix
Required Modules
torchpyyaml
Components
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: experiment Download link: https://github.com/ChuaHanChong/ml-optimizer/archive/main.zip#experiment Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.