model-deployment-patterns

Community

Deploy ML models with confidence.

AuthorHermeticOrmus
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides battle-tested patterns and configurations for deploying machine learning models efficiently and reliably into production environments.

Core Features & Use Cases

  • Production Serving: Implement robust FastAPI services with health checks and metrics.
  • Triton Deployment: Configure model repositories for NVIDIA Triton Inference Server.
  • ONNX Export & Validation: Convert PyTorch models to ONNX and verify numerical equivalence.
  • Canary Deployments: Manage traffic splitting for phased rollouts using Kubernetes and Istio.
  • Load Testing: Use Locust to simulate production traffic and validate performance.

Quick Start

Use the model-deployment-patterns skill to export a PyTorch model to ONNX format.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: model-deployment-patterns
Download link: https://github.com/HermeticOrmus/LibreMLOps-Claude-Code/archive/main.zip#model-deployment-patterns

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.