rust-gpu-scheduling

Official

Efficient GPU scheduling for ML pipelines

Authorgar-ai
Version1.0.0
Installs0

System Documentation

What problem does it solve?

VRAM-aware GPU scheduling for ML model orchestration, balancing memory budgets with workload demands to prevent OOM and reduce idle time.

Core Features & Use Cases

  • VRAM-aware scheduling with a safety margin to minimize OOM risks across multiple models (Whisper, VideoMAE, CLAP, Qwen3, DINOv3).
  • Work item queues with priority handling and dependency tracking to enforce correct execution order.
  • Model lifecycle management: track loaded/unloaded state and estimated load times to reduce costly reloads.
  • VRAM budgeting, per-model accounting, and simple semaphore-based limiting to cap concurrent workloads.
  • Guidelines and extensible architecture to integrate into ML orchestration pipelines.

Quick Start

Instantiate the GPUScheduler with your VRAM configuration and submit your first WorkItem to begin scheduling.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: rust-gpu-scheduling
Download link: https://github.com/gar-ai/mallorn/archive/main.zip#rust-gpu-scheduling

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.