How to fix Error: retries exhausted in Terraform

TerraformINTERMEDIATEMEDIUM

Retries exhausted occurs when Terraform or a provider SDK exhausts all configured retry attempts while waiting for an API operation to succeed. This typically happens with rate limiting, transient network errors, or API instability.

What this error means

When Terraform's provider attempts an API operation, it has built-in retry logic to handle transient failures (network blips, temporary API errors, rate limiting). Each provider has a maximum number of retries (often 25 for AWS) and a maximum duration to keep retrying. When the operation fails repeatedly and reaches either the retry count limit or the retry timeout duration, Terraform reports 'retries exhausted'. This means the provider gave up trying because the error persisted despite multiple attempts.

How to fix "Error: retries exhausted"

1Check the cloud provider status page

Before making configuration changes, verify the cloud provider isn't experiencing an outage:

bash

# AWS status page
https://health.aws.amazon.com/

# Azure status page
https://status.azure.com/

# Google Cloud status page
https://status.cloud.google.com/

If there's a reported incident, wait for the provider to resolve it. Re-run Terraform after the provider confirms resolution.

2Increase the provider max_retries setting

For AWS, increase the maximum retry attempts in your provider block:

hcl

provider "aws" {
  region     = "us-east-1"
  max_retries = 25  # Default is 25, increase if needed
}

For other providers, check their documentation for similar settings. Each provider may have different retry configuration options.

3Reduce parallelism to decrease API load

If you're deploying many resources at once, reduce the number of concurrent operations:

bash

# Default parallelism is 10, reduce it to 2-3
terraform apply -parallelism=3

This gives the cloud provider more time to handle each request before the next one arrives, reducing the chance of rate limiting.

You can also set this in your terraform config:

hcl

terraform {
  backend "local" {
  }
}

4Add resource-level timeout configuration

Increase timeouts for resources that take longer to stabilize:

hcl

resource "aws_instance" "example" {
  ami           = "ami-0c55b159cbfafe1f0"
  instance_type = "t2.micro"

  timeouts {
    create = "15m"
    update = "15m"
    delete = "10m"
  }
}

resource "aws_rds_cluster" "example" {
  cluster_identifier = "my-cluster"
  
  timeouts {
    create = "45m"
    update = "60m"
    delete = "30m"
  }
}

Giving resources more time reduces the likelihood of hitting retry exhaustion.

5Enable debug logging to identify the root error

Run Terraform with debug logging to see what specific API error is triggering retries:

bash

export TF_LOG=debug
terraform apply 2>&1 | tee terraform.log

# Or for maximum verbosity
export TF_LOG=trace
terraform apply 2>&1 | tee terraform.log

Search the log for HTTP status codes and actual error messages:

bash

grep -i '429\|500\|503\|timeout\|throttl' terraform.log

Look for patterns—if you see 429 (rate limit), 503 (service unavailable), or timeout errors, that identifies the problem.

6Retry the operation after waiting

If the issue is transient (provider had a brief hiccup), simply retry:

bash

# Wait a minute or two, then retry
sleep 120
terraform apply

If the apply partially succeeded, you may need to refresh state first:

bash

terraform refresh
terraform apply

For CI/CD, implement a retry loop in your pipeline script:

bash

for i in {1..3}; do
  terraform apply -auto-approve && exit 0
  echo "Attempt $i failed, waiting..."
  sleep 60
done
exit 1

How to fix Error: retries exhausted in Terraform

What this error means

Typical symptoms

Common causes

How to fix "Error: retries exhausted"

Advanced notes

Related errors

Official resources & further reading