gke-workload-scaling

Official

Scale GKE workloads with autoscaling.

AuthorGoogleCloudPlatform
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Specific workflows for scaling GKE workloads using manual scaling, Horizontal Pod Autoscaling (HPA), and Vertical Pod Autoscaling (VPA), along with best-practice guidelines for autoscaling configuration.

Core Features & Use Cases

  • Manual Scaling: quickly scale a deployment to a fixed number of replicas for immediate intervention or testing.
  • Horizontal Pod Autoscaling (HPA): auto-scale the number of pods based on observed CPU/memory or custom metrics, with a manifest-based approach using assets/hpa-example.yaml.
  • Vertical Pod Autoscaling (VPA): automatically adjust CPU and memory reservations, with templates in assets/vpa-example.yaml and recommended update modes.
  • Cluster Autoscaler considerations: ensure the node pool scales to support scaled workloads and optimize resource usage.

Quick Start

Follow the workflows to manually scale deployments, enable HPA with the provided manifest, and configure VPA using the included examples.

Dependency Matrix

Required Modules

None required

Components

assets

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: gke-workload-scaling
Download link: https://github.com/GoogleCloudPlatform/gke-mcp/archive/main.zip#gke-workload-scaling

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 223,000+ vetted skills library on demand.