How to fix DaemonSet pod not ready, readiness probe failing in Kubernetes

KubernetesINTERMEDIATEMEDIUM

DaemonSet pods fail readiness checks due to liveness/readiness probe misconfiguration, application startup delays, or resource constraints. Fix by adjusting probe parameters, increasing initialDelaySeconds, or fixing the underlying application issue.

What this error means

A DaemonSet pod that's Running but not Ready means: 1. Pod container is actually executing 2. Readiness or liveness probe is failing 3. kubelet marks pod as NotReady For DaemonSets, this is particularly problematic because: - Traffic doesn't get routed to the pod (if it's a service) - Dependent components think the pod is broken - Orchestration tools see the DaemonSet as unhealthy Unlike Deployments which can over-provision replicas, DaemonSets must have at least one pod per node, so NotReady pods leave gaps in coverage.

How to fix "DaemonSet pod not ready, readiness probe failing"

1Check probe configuration

View the current probe configuration:

bash

# Get DaemonSet YAML:
kubectl get daemonset -n <namespace> <name> -o yaml

# Look for readinessProbe and livenessProbe:
kubectl get daemonset -n <namespace> <name> -o yaml | grep -A 10 "readinessProbe\|livenessProbe"

# Check exact probe settings:
# - initialDelaySeconds
# - timeoutSeconds
# - periodSeconds
# - failureThreshold

2Check pod logs for readiness probe failures

Examine what's happening when probe fails:

bash

# Check pod logs:
kubectl logs -n <namespace> <pod-name>

# Check probe endpoint directly:
kubectl exec -n <namespace> <pod-name> -- curl -v http://localhost:8080/health

# Check application startup time:
kubectl logs -n <namespace> <pod-name> | head -20

# Look for how long app takes to be ready

3Increase initialDelaySeconds

Give the application more time to start:

bash

kubectl edit daemonset -n <namespace> <name>

Update the readiness probe:

yaml

readinessProbe:
  httpGet:
    path: /health
    port: 8080
  initialDelaySeconds: 60    # Increased from 10
  timeoutSeconds: 5
  periodSeconds: 10
  failureThreshold: 3

New pods will apply these settings:

bash

# Force rollout:
kubectl rollout restart daemonset -n <namespace> <name>

4Increase timeoutSeconds and failureThreshold

Allow more time for probe endpoint to respond:

yaml

readinessProbe:
  httpGet:
    path: /health
    port: 8080
  initialDelaySeconds: 30
  timeoutSeconds: 10        # Increased from 1
  periodSeconds: 10
  failureThreshold: 5       # Increased from 3

This gives the app 10 seconds to respond, and requires 5 consecutive failures before marking NotReady.

5Test the probe endpoint manually

Verify the probe endpoint works:

bash

# Get pod IP:
kubectl get pod -n <namespace> <pod-name> -o jsonpath='{.status.podIP}'

# Test endpoint from another pod:
kubectl run debug --rm -it --image=alpine -- sh
# Inside pod:
apk add curl
curl -v http://<pod-ip>:8080/health

# Or exec directly:
kubectl exec -n <namespace> <pod-name> -- curl -v http://localhost:8080/health

# Check response code - should be 200

If endpoint returns non-200, the application isn't actually ready.

6Check for resource constraints

Verify the pod has enough resources:

bash

# Check resource limits:
kubectl describe pod -n <namespace> <pod-name> | grep -A 3 "Limits\|Requests"

# Check if pod is hitting limits:
kubectl top pod -n <namespace> <pod-name>

# Check node for resource pressure:
kubectl describe node <node-name> | grep -A 5 "Allocated resources\|Conditions"

# If resources are tight, increase limits:
kubectl edit daemonset -n <namespace> <name>

Insufficient CPU/memory can slow startup significantly.

7Fix application startup issues

If the application itself is broken:

bash

# Check application logs for errors:
kubectl logs -n <namespace> <pod-name> --previous  # If pod crashed and restarted

# Common issues:
# - Missing environment variables
# - Database connection failures
# - Configuration loading errors
# - Port already in use

# Set/verify environment variables:
kubectl get daemonset -n <namespace> <name> -o yaml | grep -A 10 env:

# Check if port is already in use:
kubectl exec -n <namespace> <pod-name> -- netstat -tlnp | grep 8080

Fix the application issues, then redeploy.

8Verify probe configuration after changes

After making changes, confirm the fix:

bash

# Watch pod transitions:
kubectl get pods -n <namespace> -w | grep <pod-name>

# Check if pod becomes Ready:
kubectl get pods -n <namespace> <pod-name> -o wide

# Monitor probe results in events:
kubectl describe pod -n <namespace> <pod-name> | tail -20

# Once ready, verify all DaemonSet pods are ready:
kubectl get daemonset -n <namespace> <name>
# READY column should show N/N where N is number of nodes

How to fix DaemonSet pod not ready, readiness probe failing in Kubernetes

What this error means

Typical symptoms

Common causes

How to fix "DaemonSet pod not ready, readiness probe failing"

Advanced notes

Related errors

Official resources & further reading