Acme Status — status.acme.com

Acme Cloudstatus.acme.com · all times UTC

Partial outage

Elevated error rates on Webhooks API

Started Apr 24 · 14:32 UTC Affected: Webhooks API INC-2026-0184
Monitoring15:12 UTC
Backlog draining.

Queue depth has fallen from 240k to 41k. Delivery latency is recovering. We will continue to watch for the next 30 minutes before resolving.

Identified14:48 UTC
Root cause identified.

A misconfigured retry policy was producing duplicate sends and saturating the worker pool. Configuration rolled back; throughput recovering.

Investigating14:32 UTC
Elevated 5xx on Webhooks API.

We are seeing increased error rates on the webhook delivery service. The dashboard and read API are not affected.

Scheduled maintenance · Database failover drill

Apr 27 · 02:00 – 03:00 UTC · 60-minute window · Read API will be in read-only mode
Add to calendar (ICS)

Component status

API services

Partial outage
Public REST APIapi.acme.com
90-day uptime · 99.974%
Operational
Webhooks APIwebhooks.acme.com
90-day uptime · 99.62%
Partial outage
GraphQL APIgraph.acme.com
90-day uptime · 99.998%
Operational

Web applications

All operational
Dashboardapp.acme.com
90-day uptime · 99.99%
Operational
Marketing sitewww.acme.com
90-day uptime · 100%
Operational

Background services

All operational
Event processinginternal
90-day uptime · 99.99%
Operational
Search indexinternal
90-day uptime · 100%
Operational

Get told when something changes

One email per incident. Unsubscribe in one click. Webhook and Slack also available.

Channels: email · webhook · Slack · SMS (major only)

Recent history · last 14 days

Apr 18Identified — Brief 503s on Public REST15 min · Public REST API · resolvedPartial outage
Apr 15Scheduled — Postgres failover drill60 min · Dashboard, Public REST · completedMaintenance
Apr 09Degraded search response times22 min · Search index · resolvedDegraded

View all past incidents →