rust-gpu-scheduling
OfficialEfficient GPU scheduling for ML pipelines
Software Engineering#dependencies#scheduling#gpu#memory-management#ml-pipelines#priority-queuing#model-lifecycle
Authorgar-ai
Version1.0.0
Installs0
System Documentation
What problem does it solve?
VRAM-aware GPU scheduling for ML model orchestration, balancing memory budgets with workload demands to prevent OOM and reduce idle time.
Core Features & Use Cases
- VRAM-aware scheduling with a safety margin to minimize OOM risks across multiple models (Whisper, VideoMAE, CLAP, Qwen3, DINOv3).
- Work item queues with priority handling and dependency tracking to enforce correct execution order.
- Model lifecycle management: track loaded/unloaded state and estimated load times to reduce costly reloads.
- VRAM budgeting, per-model accounting, and simple semaphore-based limiting to cap concurrent workloads.
- Guidelines and extensible architecture to integrate into ML orchestration pipelines.
Quick Start
Instantiate the GPUScheduler with your VRAM configuration and submit your first WorkItem to begin scheduling.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: rust-gpu-scheduling Download link: https://github.com/gar-ai/mallorn/archive/main.zip#rust-gpu-scheduling Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.