How to fix Error response from daemon: The swarm does not have a leader in Docker

DockerADVANCEDCRITICAL

This error occurs when a Docker Swarm cluster loses quorum because too few manager nodes are available. The Raft consensus algorithm requires a majority of managers to be online to elect a leader. Recovery typically involves reinitializing the swarm with --force-new-cluster on a surviving manager.

What this error means

The "swarm does not have a leader" error indicates that your Docker Swarm cluster has lost quorum and cannot perform any management operations. Docker Swarm uses the Raft consensus algorithm to maintain a consistent cluster state across all manager nodes. This algorithm requires a strict majority (more than half) of manager nodes to be available to elect a leader and make decisions. When the cluster loses its leader (due to network partitions, node failures, or maintenance), the remaining managers attempt to hold an election. If there aren't enough managers available to form a majority, no leader can be elected, and the swarm becomes unable to process any management commands. For example, in a 3-manager cluster, you need at least 2 managers online. If 2 managers go down, the remaining 1 manager (which is only 33% of the cluster) cannot achieve quorum. Similarly, a 5-manager cluster can tolerate 2 failures but not 3. While the swarm is leaderless, existing services and containers continue running on worker nodes. However, you cannot deploy new services, update existing ones, add or remove nodes, or perform any other management tasks until quorum is restored.

How to fix "Error response from daemon: The swarm does not have a leader"

1Check current swarm status and identify available managers

First, determine which manager nodes are still accessible. On any node that can communicate:

bash

docker node ls

This may fail with the "no leader" error. If so, check individual node status:

bash

docker info | grep -A 20 "Swarm:"

Look for:
- Is Manager: true
- Managers: X
- Nodes: Y

Check if the Docker daemon is running on all manager nodes:

bash

# On each manager node
systemctl status docker
docker info

List the Raft state directory to see if data exists:

bash

ls -la /var/lib/docker/swarm/raft/

2Verify network connectivity between managers

Manager nodes must be able to communicate on specific ports. Test connectivity from each manager:

bash

# Test cluster management port (TCP 2377)
nc -zv <other_manager_ip> 2377

# Test node communication port (TCP/UDP 7946)
nc -zv <other_manager_ip> 7946

# Test overlay network traffic (UDP 4789)
nc -zvu <other_manager_ip> 4789

Check firewall rules:

bash

# For iptables
sudo iptables -L -n | grep -E "2377|7946|4789"

# For firewalld
sudo firewall-cmd --list-all

Ensure these ports are open between all manager nodes:
- 2377/tcp: Cluster management
- 7946/tcp+udp: Node communication
- 4789/udp: Overlay network traffic

3Attempt to recover quorum by bringing managers back online

If manager nodes are simply down (not lost), bring them back online:

bash

# On each offline manager
sudo systemctl start docker

# Wait for the node to rejoin
sleep 30

# Check if leader is elected
docker node ls

If managers are stuck, try restarting Docker:

bash

sudo systemctl restart docker

Check Docker logs for errors:

bash

sudo journalctl -u docker -f

Look for messages about Raft elections, connectivity issues, or leader election timeouts.

Important: If you have 3 managers and only 1 is healthy, you cannot recover quorum this way. You need at least 2 of 3 managers to achieve majority.

4Force a new cluster on a surviving manager (last resort)

If you cannot bring enough managers back online, reinitialize the swarm from a surviving manager with intact Raft data:

bash

# On the manager with the most recent data
docker swarm init --force-new-cluster --advertise-addr <manager_ip>

This command:
- Creates a new single-node swarm using existing Raft data
- Preserves service definitions, secrets, and configs
- Maintains worker node registrations (they'll reconnect)
- Makes this node the new leader

After forcing a new cluster:

bash

# Verify the new swarm is working
docker node ls

# Check services are intact
docker service ls

# Get the new join tokens
docker swarm join-token manager
docker swarm join-token worker

Warning: Only use --force-new-cluster when you cannot restore quorum normally. Running it on multiple nodes simultaneously can cause split-brain scenarios.

5Remove failed manager nodes and add new ones

After recovering, clean up failed nodes and restore redundancy:

bash

# List all nodes
docker node ls

# Remove unreachable manager nodes
docker node demote <failed_node_id>
docker node rm --force <failed_node_id>

Add new manager nodes to restore fault tolerance:

bash

# On the leader, get the manager join token
docker swarm join-token manager

# On the new manager node
docker swarm join --token <manager_token> <leader_ip>:2377

Promote existing workers to managers if needed:

bash

docker node promote <worker_node_id>

Verify the cluster is healthy:

bash

docker node ls
# Look for one "Leader" and multiple "Reachable" manager statuses

6Implement proper manager redundancy

Prevent future quorum loss by following Docker's recommendations:

Use an odd number of managers (3, 5, or 7):
- 3 managers: Tolerates 1 failure
- 5 managers: Tolerates 2 failures
- 7 managers: Tolerates 3 failures (maximum recommended)

Never use 2 managers - if one fails, you have 50% which is not a majority.

Spread managers across failure domains:

bash

# When adding managers, use different availability zones/racks
docker node update --label-add zone=us-east-1a manager1
docker node update --label-add zone=us-east-1b manager2
docker node update --label-add zone=us-east-1c manager3

Use static IPs for manager nodes to ensure reliable communication.

Configure manager-only nodes:

bash

# Prevent workloads from running on managers
docker node update --availability drain <manager_node_id>

This dedicates managers to cluster management and improves Raft performance.

7Set up monitoring and backup procedures

Implement proactive monitoring to detect quorum issues early:

Monitor manager health:

bash

#!/bin/bash
# Check swarm health
if ! docker node ls &>/dev/null; then
  echo "CRITICAL: Swarm has no leader!"
  # Send alert
fi

# Count reachable managers
managers=$(docker node ls --filter role=manager -q 2>/dev/null | wc -l)
reachable=$(docker node ls --filter role=manager --format "{{.Status}}" 2>/dev/null | grep -c "Ready")
echo "Managers: $reachable/$managers reachable"

Backup swarm configuration regularly:

bash

# Backup swarm data (run on a manager)
sudo tar -czvf swarm-backup-$(date +%Y%m%d).tar.gz /var/lib/docker/swarm/

Use Docker's autolock feature for additional security:

bash

docker swarm update --autolock=true
# Save the unlock key securely!

If autolock is enabled and a manager restarts, you'll need to unlock it:

bash

docker swarm unlock

How to fix Error response from daemon: The swarm does not have a leader in Docker

What this error means

Typical symptoms

Common causes

How to fix "Error response from daemon: The swarm does not have a leader"

Advanced notes

Related errors

Official resources & further reading