gke-workload-scaling
OfficialScale GKE workloads with autoscaling.
AuthorGoogleCloudPlatform
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Specific workflows for scaling GKE workloads using manual scaling, Horizontal Pod Autoscaling (HPA), and Vertical Pod Autoscaling (VPA), along with best-practice guidelines for autoscaling configuration.
Core Features & Use Cases
- Manual Scaling: quickly scale a deployment to a fixed number of replicas for immediate intervention or testing.
- Horizontal Pod Autoscaling (HPA): auto-scale the number of pods based on observed CPU/memory or custom metrics, with a manifest-based approach using assets/hpa-example.yaml.
- Vertical Pod Autoscaling (VPA): automatically adjust CPU and memory reservations, with templates in assets/vpa-example.yaml and recommended update modes.
- Cluster Autoscaler considerations: ensure the node pool scales to support scaled workloads and optimize resource usage.
Quick Start
Follow the workflows to manually scale deployments, enable HPA with the provided manifest, and configure VPA using the included examples.
Dependency Matrix
Required Modules
None requiredComponents
assets
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: gke-workload-scaling Download link: https://github.com/GoogleCloudPlatform/gke-mcp/archive/main.zip#gke-workload-scaling Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.