How to fix MemoryPressure in Kubernetes

KubernetesINTERMEDIATEHIGH

MemoryPressure indicates a node is running critically low on available memory. The kubelet responds by tainting the node, stopping new pod scheduling, and evicting lower-priority pods to reclaim memory.

What this error means

MemoryPressure is a node condition that indicates a Kubernetes node is running critically low on available memory. When the kubelet detects that available memory has dropped below its configured eviction threshold, it sets the MemoryPressure condition to True and applies the node.kubernetes.io/memory-pressure:NoSchedule taint to prevent new pods from being scheduled on that node. This condition triggers the kubelet's node-pressure eviction mechanism, which proactively terminates pods on the node to reclaim memory and prevent a system-level Out-Of-Memory (OOM) killer from becoming involved. The kubelet monitors memory continuously via cAdvisor and makes eviction decisions based on pod QoS class (Guaranteed, Burstable, or BestEffort), with BestEffort pods being evicted first. MemoryPressure can arise from legitimate high workloads, but often indicates resource overcommitment, memory leaks in applications, or insufficient node capacity for the scheduled pods. If eviction cannot reclaim memory quickly enough, the Linux kernel's OOM killer may still be invoked.

How to fix "MemoryPressure"

1Verify MemoryPressure condition on the node

Confirm the node is actually under memory pressure:

bash

kubectl describe node <NODE_NAME>

Look for the Conditions section and check if MemoryPressure=True. Also inspect the Taints section to see if node.kubernetes.io/memory-pressure:NoSchedule is present.

2Check actual memory usage on the node

Examine current memory consumption to understand the scope:

bash

# View node resource usage
kubectl top nodes

# View pod memory usage on specific node
kubectl top pods --all-namespaces --sort-by=memory

# SSH into node and check kernel's view of memory
kubectl debug node/<NODE_NAME> -it --image=ubuntu
# Inside debug container:
cat /proc/meminfo
free -h

Note: 'kubectl top' uses kubelet cgroup stats (the authoritative source), while 'free -h' may show different values.

3Identify pods to evict or remove

Review pods on the affected node and identify which can be safely removed:

bash

# List all pods on specific node
kubectl get pods --all-namespaces --field-selector spec.nodeName=<NODE_NAME>

# Check pod resource requests/limits
kubectl describe pod <POD_NAME> -n <NAMESPACE> | grep -A 5 'Limits\|Requests'

# Identify BestEffort pods (no resource requests/limits - evicted first)
kubectl get pods --all-namespaces -o custom-columns=NAME:.metadata.name,NAMESPACE:.metadata.namespace,QOS:.status.qosClass --field-selector spec.nodeName=<NODE_NAME>

Pods with BestEffort QoS will be evicted first. Consider scaling down non-critical workloads.

4Configure proper eviction thresholds

Set kubelet eviction thresholds to reclaim memory before the system OOM killer engages. Edit the kubelet configuration:

yaml

# In /etc/kubernetes/kubelet-config.yaml
evictionHard:
  memory.available: "500Mi"
evictionSoft:
  memory.available: "1.5Gi"
evictionSoftGracePeriod:
  memory.available: "1m30s"
systemReserved:
  memory: "1.5Gi"
kubeletReserved:
  memory: "100Mi"

Restart the kubelet after changes. Set soft thresholds ~20-30% above hard thresholds to give time for graceful eviction.

5Set appropriate resource requests and limits

Ensure pods declare realistic resource requests/limits:

yaml

apiVersion: v1
kind: Pod
metadata:
  name: my-app
spec:
  containers:
  - name: app
    image: my-app:1.0
    resources:
      requests:
        memory: "256Mi"
      limits:
        memory: "512Mi"

Run your application under load to measure realistic memory usage, then set requests to ~80% of peak observed usage. Consider using Vertical Pod Autoscaler (VPA) to analyze historical usage.

6Add more memory or scale the cluster

If memory pressure is persistent despite tuning, increase cluster capacity:

bash

# Scale up node pool (cloud provider specific)
# GKE example:
gcloud container node-pools update <POOL_NAME> \
  --machine-type n1-standard-4 \
  --cluster <CLUSTER_NAME>

Alternatively, distribute pods across more nodes using pod anti-affinity:

yaml

affinity:
  podAntiAffinity:
    preferredDuringSchedulingIgnoredDuringExecution:
    - weight: 100
      podAffinityTerm:
        labelSelector:
          matchExpressions:
          - key: app
            operator: In
            values:
            - my-app
        topologyKey: kubernetes.io/hostname

How to fix MemoryPressure in Kubernetes

What this error means

Typical symptoms

Common causes

How to fix "MemoryPressure"

Advanced notes

Related errors

Official resources & further reading