How to fix Error response from daemon: node is not available in Docker

DockerINTERMEDIATEMEDIUM

This error occurs when Docker Swarm cannot schedule tasks on a node because it is set to 'drain' or 'pause' availability, or the node has gone offline. The fix involves checking node status, updating availability settings, or rejoining the node to the swarm.

What this error means

The "node is not available" error in Docker Swarm indicates that the swarm manager cannot assign tasks to a particular node. This happens when a node's availability state prevents it from receiving new workloads. In Docker Swarm, every node has an availability setting that determines whether it can accept tasks: - **Active**: The node can receive and run tasks (default state when joining) - **Pause**: The node cannot receive new tasks, but existing tasks continue running - **Drain**: The scheduler stops all existing tasks and moves them to other active nodes; no new tasks are assigned This error commonly appears when you try to deploy a service with placement constraints targeting a node that is drained or paused, when a node has gone offline due to network issues, or when a manager node has been drained to dedicate it to management tasks only.

How to fix "Error response from daemon: node is not available"

1Check node availability status

First, inspect the current state of all nodes in your swarm:

bash

docker node ls

This shows the availability of each node. Look for nodes with "Drain" or "Pause" in the AVAILABILITY column:

bash

ID                            HOSTNAME   STATUS    AVAILABILITY   MANAGER STATUS
abc123def456...   manager1   Ready     Active         Leader
xyz789ghi012...   worker1    Ready     Drain

To get detailed information about a specific node:

bash

docker node inspect <node_name> --pretty

Check the "Availability" field in the output.

2Set the node back to active availability

If the node shows "Drain" or "Pause" status and you want it to accept tasks, update its availability:

bash

docker node update --availability active <node_name>

For example:

bash

docker node update --availability active worker1

Verify the change:

bash

docker node ls

The node should now show "Active" in the AVAILABILITY column. Existing services will automatically reschedule tasks to this node if needed.

3Check if the node is connected to the swarm

If the node shows "Down" status instead of "Ready", it has lost connection to the swarm:

bash

docker node ls

Look for:

bash

ID                            HOSTNAME   STATUS    AVAILABILITY   MANAGER STATUS
xyz789ghi012...   worker1    Down      Active

On the disconnected node, check if Docker is running:

bash

sudo systemctl status docker

If Docker is stopped, start it:

bash

sudo systemctl start docker

Check swarm membership on the node:

bash

docker info | grep -A 5 "Swarm"

If the node thinks it's not in a swarm, you may need to rejoin it.

4Rejoin a node to the swarm

If a node has been disconnected or removed, rejoin it to the swarm.

On a manager node, get the join token:

bash

# For worker nodes
docker swarm join-token worker

# For manager nodes
docker swarm join-token manager

This outputs a command like:

bash

docker swarm join --token SWMTKN-1-abc123... 192.168.1.100:2377

On the node to rejoin, first leave any existing swarm:

bash

docker swarm leave --force

Then run the join command provided by the manager:

bash

docker swarm join --token SWMTKN-1-abc123... 192.168.1.100:2377

On the manager, remove the old node entry if it persists:

bash

docker node rm <old_node_id>

5Check and fix network connectivity

Docker Swarm requires specific ports to be open between nodes:

- TCP 2377: Cluster management communications
- TCP/UDP 7946: Communication among nodes (gossip protocol)
- UDP 4789: Overlay network traffic (VXLAN)

Test connectivity from a worker to the manager:

bash

nc -zv <manager_ip> 2377
nc -zv <manager_ip> 7946
nc -zuv <manager_ip> 4789

If using UFW (Ubuntu firewall):

bash

sudo ufw allow 2377/tcp
sudo ufw allow 7946/tcp
sudo ufw allow 7946/udp
sudo ufw allow 4789/udp
sudo ufw reload

If using firewalld (RHEL/CentOS):

bash

sudo firewall-cmd --permanent --add-port=2377/tcp
sudo firewall-cmd --permanent --add-port=7946/tcp
sudo firewall-cmd --permanent --add-port=7946/udp
sudo firewall-cmd --permanent --add-port=4789/udp
sudo firewall-cmd --reload

Check for HTTP proxy interference: If you have an HTTP proxy configured, it may intercept swarm traffic. Ensure Docker traffic bypasses the proxy.

6Review and fix placement constraints

If you're using placement constraints in your service definition, ensure they target available nodes.

Check existing service constraints:

bash

docker service inspect <service_name> --pretty | grep -A 10 "Placement"

Common constraint issues:

1. Targeting a drained node by name:

yaml

# This fails if worker1 is drained
deploy:
  placement:
    constraints:
      - node.hostname == worker1

2. Using labels that don't exist on active nodes:

yaml

deploy:
  placement:
    constraints:
      - node.labels.region == us-west

Add labels to nodes:

bash

docker node update --label-add region=us-west worker1

Verify node labels:

bash

docker node inspect worker1 --format '{{ .Spec.Labels }}'

Update service to remove or modify constraints:

bash

docker service update --constraint-rm 'node.hostname == worker1' <service_name>

7Remove stale node references

If a node was removed but tasks still reference it, clean up the stale entries.

List all nodes including down ones:

bash

docker node ls

Remove nodes that are no longer part of the cluster:

bash

docker node rm <node_id>

If the node is still showing tasks:

bash

docker node rm --force <node_id>

Check for orphaned tasks:

bash

docker service ps <service_name> --filter "desired-state=running"

If tasks are stuck referencing a removed node, force a service update:

bash

docker service update --force <service_name>

This reschedules all tasks, placing them on available nodes.

How to fix Error response from daemon: node is not available in Docker

What this error means

Typical symptoms

Common causes

How to fix "Error response from daemon: node is not available"

Advanced notes

Related errors

Official resources & further reading