How to fix KubeletNotReady in Kubernetes

KubernetesINTERMEDIATECRITICAL

KubeletNotReady occurs when the kubelet service on a worker node becomes unhealthy or stops communicating with the control plane. This is often caused by container runtime failures, certificate issues, or systemd service problems.

What this error means

The KubeletNotReady condition signals that the kubelet—the Kubernetes agent running on each node responsible for managing pods and containers—is either down, misconfigured, or unable to communicate with the API server. When this occurs, the control plane marks the node as 'NotReady' and stops scheduling new pods to that node. This error appears in the node status conditions and typically means the kubelet process is not running, the systemd service has failed, or the kubelet cannot establish a healthy connection to the control plane due to certificate validation failures, container runtime issues, or network problems. Underlying causes range from transient issues (restarting the kubelet fixes them immediately) to persistent configuration problems (cgroup driver mismatches, missing container runtime endpoints) and infrastructure issues (disk full, swap enabled, network isolation, certificate expiration).

How to fix "KubeletNotReady"

1Check node status and conditions

Verify the node is actually NotReady and identify which conditions are reporting 'Unknown':

bash

kubectl get nodes
kubectl describe node <node-name>

Look at the 'Conditions' section. If all conditions show 'Unknown', the kubelet is down. If specific conditions (MemoryPressure, DiskPressure) show True, you have a resource exhaustion issue. The 'Status' field and 'Last Heartbeat' time tell you when the kubelet last communicated.

2SSH into the node and check kubelet service status

Connect directly to the affected worker node and inspect the kubelet systemd service:

bash

ssh <node-ip>
sudo systemctl status kubelet
sudo journalctl -xu kubelet.service --no-pager | tail -50

The journalctl output shows the actual error preventing kubelet from starting. Common errors include 'running with swap on is not supported', 'container runtime endpoint address was not specified', or 'misconfiguration: kubelet cgroup driver'.

3Verify container runtime is healthy

Check if the container runtime (Docker or containerd) is running:

bash

# For containerd
sudo systemctl status containerd
sudo systemctl restart containerd

# For Docker
sudo systemctl status docker
sudo systemctl restart docker

Ensure the kubelet configuration points to the correct runtime endpoint. If using containerd:

bash

# Check kubelet extra args
cat /etc/systemd/system/kubelet.service.d/0-containerd.conf
# Should contain:
# Environment="KUBELET_EXTRA_ARGS=--container-runtime=remote --container-runtime-endpoint=unix:///run/containerd/containerd.sock"

4Disable swap and fix cgroup driver mismatches

Swap must be disabled for kubelet to function. Permanently disable it:

bash

# Immediately disable
sudo swapoff -a

# Permanently disable by editing fstab
sudo nano /etc/fstab
# Comment out any line containing 'swap'

If kubelet and container runtime have mismatched cgroup drivers, edit the kubelet config:

bash

sudo nano /etc/systemd/system/kubelet.service.d/10-kubeadm.conf
# Add: Environment="KUBELET_CGROUP_ARGS=--cgroup-driver=systemd"

# Then reload and restart
sudo systemctl daemon-reload
sudo systemctl restart kubelet

5Check and deploy CNI plugin

If the error mentions 'NetworkPluginNotReady' or '/etc/cni/net.d' is empty, install a CNI plugin:

bash

# First check if CNI config exists
ls /etc/cni/net.d/

# If empty, apply a network plugin (example: Weave)
kubectl apply -f https://github.com/weaveworks/weave/releases/download/v2.8.1/weave-daemonset-k8s.yaml

# Or for Flannel
kubectl apply -f https://raw.githubusercontent.com/coreos/flannel/master/Documentation/kube-flannel.yml

# Restart kubelet and wait for CNI pod to initialize
sudo systemctl restart kubelet

6Validate kubelet certificates and restart service

Check that kubelet client certificates are valid and trusted by the API server:

bash

# Check certificate expiration
sudo openssl x509 -in /var/lib/kubelet/pki/kubelet-client-current.pem -text -noout | grep -A2 'Validity'

If certificates are expired or invalid, regenerate them (on control plane):

bash

sudo kubeadm certs renew all

Then restart kubelet:

bash

sudo systemctl daemon-reload
sudo systemctl restart kubelet

# Monitor startup
sudo journalctl -xu kubelet.service -f

How to fix KubeletNotReady in Kubernetes

What this error means

Typical symptoms

Common causes

How to fix "KubeletNotReady"

Advanced notes

Related errors

Official resources & further reading