How to fix Error response from daemon: rpc error: code = ResourceExhausted desc = grpc: received message larger than max in Docker

DockerINTERMEDIATEMEDIUM

Docker Swarm returns 'rpc error: code = ResourceExhausted desc = grpc: received message larger than max' when internal gRPC messages exceed the 4MB default limit. This typically occurs in large clusters with many services, secrets, or configs.

What this error means

This error occurs when Docker Swarm's internal communication exceeds gRPC's default message size limit. Docker Swarm uses gRPC (Google Remote Procedure Call) for communication between manager nodes and for cluster state synchronization. The default maximum message size is 4,194,304 bytes (4MB). When the Swarm cluster state grows large enough (due to many services, secrets, configs, or nodes), the gRPC messages exchanged between managers can exceed this limit. The error message typically shows something like "grpc: received message larger than max (5351376 vs. 4194304)" where the first number is the actual message size and the second is the limit. This is particularly common in large production clusters with hundreds of services, or when using features like Docker secrets and configs extensively. The issue can also appear during manager node joins when the cluster snapshot being transferred is large.

How to fix "Error response from daemon: rpc error: code = ResourceExhausted desc = grpc: received message larger than max"

1Check Docker Engine version

First, verify you're running a Docker version with increased gRPC limits. Docker 18.09.1+ includes fixes for this issue:

bash

docker version

If you're running an older version, upgrading Docker is the recommended solution. Docker 18.09+ increased the gRPC message size limits for most Swarm operations.

2Upgrade Docker Engine

If you're on an older version, upgrade to the latest stable Docker release:

Ubuntu/Debian:

bash

sudo apt-get update
sudo apt-get install docker-ce docker-ce-cli containerd.io

CentOS/RHEL:

bash

sudo yum update docker-ce docker-ce-cli containerd.io

Or use the convenience script:

bash

curl -fsSL https://get.docker.com | sh

After upgrading, restart Docker and rejoin the Swarm if necessary.

3Reduce cluster state size

If upgrading isn't immediately possible, reduce the cluster state by cleaning up unused resources:

bash

# Remove unused services
docker service ls
docker service rm <unused_service>

# Remove unused secrets
docker secret ls
docker secret rm <unused_secret>

# Remove unused configs
docker config ls
docker config rm <unused_config>

# Remove unused networks
docker network prune

# Remove stopped containers and unused images
docker system prune -a

Pay special attention to services with restart policies that may have accumulated many task histories.

4Check and reduce service task history

Docker keeps history of service tasks which contributes to cluster state size. Reduce the task history limit:

bash

# Check current task history limit
docker info | grep "Task History"

# Update global task history retention (default is 5)
docker swarm update --task-history-limit 2

For existing services with large histories, you can recreate them:

bash

# Export service definition
docker service inspect <service_name> > service-backup.json

# Remove and recreate (will lose history)
docker service rm <service_name>
docker service create --name <service_name> <other_options>

5Optimize service definitions

Large service definitions contribute to cluster state. Optimize them:

yaml

# Instead of inline configs/secrets in compose files
# Use external configs that reference existing objects
configs:
  my_config:
    external: true

secrets:
  my_secret:
    external: true

Avoid putting large files directly in Docker configs. Instead, mount them as volumes or bake them into images.

For services with many environment variables, consider using env files baked into the image rather than Swarm-level environment configuration.

6Rebuild the Swarm cluster (last resort)

If the cluster state is too corrupted or large to recover, you may need to rebuild:

bash

# On each worker node
docker swarm leave

# On manager nodes (except one)
docker swarm leave --force

# On the last manager
docker swarm leave --force

# Reinitialize on the primary manager
docker swarm init --advertise-addr <MANAGER_IP>

# Rejoin other managers
docker swarm join-token manager
# Use the token to rejoin managers

# Rejoin workers
docker swarm join-token worker
# Use the token to rejoin workers

Warning: This will remove all services, secrets, and configs. Export them first and redeploy after reinitializing.

How to fix Error response from daemon: rpc error: code = ResourceExhausted desc = grpc: received message larger than max in Docker

What this error means

Typical symptoms

Common causes

How to fix "Error response from daemon: rpc error: code = ResourceExhausted desc = grpc: received message larger than max"

Advanced notes

Related errors

Official resources & further reading