Kubernetes HPA debugging

My Horizontal Pod Autoscaler wasn’t scaling as expected. Here’s how I debugged it.

My debugging steps Link to heading

When HPA isn’t scaling:

The most common issue I see: pods without CPU/memory requests. HPA can’t calculate utilisation percentage without knowing the baseline.

Check HPA status:

kubectl get hpa

Describe for more detail:

kubectl describe hpa my-app

Look for the Conditions section - it tells you why scaling isn’t happening.

Check all HPAs across namespaces:

kubectl get hpa --all-namespaces

Check metrics-server:

kubectl get pods -n kube-system | grep metrics

View current metrics:

kubectl top pods

For monitoring commands, see monitoring with watch and top.

CPU - Good for compute-bound workloads. Target 50-70% utilisation.
Memory - Less useful for scaling (memory doesn’t release as quickly). Better for alerting.
Custom metrics - Queue depth, request latency, connections. More accurate for I/O-bound services.

For web services, CPU at 70% target typically works well. For queue workers, queue depth via custom metrics is more accurate.

If HPA is stuck, sometimes deleting and recreating helps:

kubectl delete hpa my-app
kubectl apply -f hpa.yaml

If your HPA isn’t being created at all, check your Helm templates are rendering correctly with helm template.