ml-ops
CommunityStreamline ML model lifecycle.
Software Engineering#monitoring#mlops#machine learning#model training#pytorch#model deployment#experiment tracking
AuthorRedBeret
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill addresses the complexities of managing the entire machine learning model lifecycle, from initial training to ongoing monitoring in production. It aims to reduce the manual effort and potential for error in MLOps workflows.
Core Features & Use Cases
- Model Training: Provides guidance on setting up basic training loops and fine-tuning pre-trained models.
- Experiment Tracking: Emphasizes the importance of logging hyperparameters, metrics, and artifacts for reproducibility.
- Model Versioning & Deployment: Covers saving model checkpoints and outlines various serving options like FastAPI and TorchServe.
- Monitoring: Highlights the need to track data drift, model drift, latency, and error rates.
- Use Case: When developing a new image classification model, use this Skill to set up experiment tracking with MLflow, fine-tune a BERT model using HuggingFace's
Trainer, and deploy it as a FastAPI service.
Quick Start
Use the ml-ops skill to train a PyTorch model with experiment tracking using MLflow.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ml-ops Download link: https://github.com/RedBeret/agent-skill-catalog/archive/main.zip#ml-ops Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.