marlowe-slurm-operator

Community

Operate Marlowe HPC cluster with Slurm.

AuthorTianyuDu
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill simplifies the complex process of interacting with the Stanford Marlowe HPC cluster, ensuring users can submit, monitor, and manage their jobs effectively and compliantly.

Core Features & Use Cases

  • Cluster State Verification: Discovers and verifies live cluster facts before proposing commands, preventing guesswork.
  • Safe Job Submission: Guides users through sbatch, salloc, and srun with Marlowe-specific account and partition requirements.
  • Monitoring & Diagnosis: Helps track job status, diagnose pending reasons, and review finished jobs using Slurm commands.
  • GPU-Hour Tracking: Provides guidance on monitoring GPU-hour consumption for relevant projects.
  • Use Case: A researcher needs to submit a GPU-accelerated job on the Marlowe cluster. They can use this Skill to ensure they are using the correct account suffix, partition, and loading the necessary modules, then submit the job safely and monitor its progress.

Quick Start

Use the marlowe-slurm-operator skill to verify the current state of the 'preempt' partition on the Marlowe cluster.

Dependency Matrix

Required Modules

None required

Components

referencestemplates

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: marlowe-slurm-operator
Download link: https://github.com/TianyuDu/SLURM-HPC-AGENT-SKILL/archive/main.zip#marlowe-slurm-operator

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.