How to fix ACME challenge failed in Kubernetes

KubernetesINTERMEDIATEHIGH

The ACME challenge failed error occurs when cert-manager cannot validate domain ownership with Let's Encrypt during TLS certificate provisioning. This typically blocks certificate issuance and renewal, preventing HTTPS traffic. Common causes include DNS propagation issues, HTTP endpoint accessibility problems, and rate limiting violations.

What this error means

This error indicates that your Kubernetes cluster, using cert-manager with Let's Encrypt, failed to complete the ACME challenge process. ACME (Automated Certificate Management Environment) requires domain ownership validation before issuing TLS certificates. The failure prevents certificate creation or renewal, which means your applications cannot serve HTTPS traffic. The error can occur at different stages: DNS validation (DNS-01), HTTP validation (HTTP-01), or self-verification checks.

How to fix "ACME challenge failed"

1Debug the challenge failure

First, inspect the challenge and order objects to understand why validation failed:

bash

kubectl describe challenge
kubectl describe order

Look for error messages indicating DNS or HTTP validation failures. The output will show the specific challenge type (dns01 or http01) and any error messages from the validation attempt.

2Verify Let's Encrypt can reach your endpoints

For HTTP-01 challenges, verify that Let's Encrypt servers can reach your challenge URL:

bash

curl -I http://yourdomain.com/.well-known/acme-challenge/test-token

For DNS-01 challenges, verify TXT records are visible externally:

bash

dig _acme-challenge.yourdomain.com TXT @8.8.8.8

3Fix HTTP-01 challenge issues

If using HTTP-01, check your ingress LoadBalancer configuration:

yaml

apiVersion: v1
kind: Service
metadata:
  name: ingress-nginx
spec:
  externalTrafficPolicy: Cluster

Also verify no client certificate authentication blocks the ACME validation endpoints.

4Fix DNS-01 challenge issues

For DNS-01 challenges, configure cert-manager to use a specific DNS server for self-checks:

yaml

apiVersion: cert-manager.io/v1
kind: ClusterIssuer
metadata:
  name: letsencrypt-prod
spec:
  acme:
    server: https://acme-v02.api.letsencrypt.org/directory
    solvers:
    - dns01:
        cloudflare:
          apiTokenSecretRef:
            name: cloudflare-api
            key: token

Verify DNS provider credentials are correct and API access is enabled.

5Handle rate limiting

If you see 429 errors, you've hit Let's Encrypt rate limits. Use the staging issuer first:

yaml

apiVersion: cert-manager.io/v1
kind: ClusterIssuer
metadata:
  name: letsencrypt-staging
spec:
  acme:
    server: https://acme-staging-v02.api.letsencrypt.org/directory

Wait 1 hour before retrying with the production issuer.

6Enable temporary certificates

For TLS handshake failures, enable temporary certificate generation:

yaml

apiVersion: cert-manager.io/v1
kind: Certificate
metadata:
  name: example-com
spec:
  dnsNames:
  - example.com
  issuerRef:
    name: letsencrypt-prod
  issueTemporaryCertificate: true

This allows the ingress to start with a temporary cert while the ACME challenge completes.

7Use in-place HTTP challenge resolution

For multiple resources on the same hostname, use:

yaml

apiVersion: networking.k8s.io/v1
kind: Ingress
metadata:
  annotations:
    acme.cert-manager.io/http01-edit-in-place: "true"

This allows cert-manager to modify your existing ingress for the challenge rather than creating a new one.

How to fix ACME challenge failed in Kubernetes

What this error means

Typical symptoms

Common causes

How to fix "ACME challenge failed"

Advanced notes

Related errors

Official resources & further reading