Skip to main content

When the internet blinked, teams needed clarity fast.

On Oct 20, 2025, a major AWS us-east-1 outage rippled across SaaS, messaging, conferencing, and cloud providers—reminding us that incidents aren’t an if, they’re a when.

PagerDuty’s core incident notifications stayed resilient, and our infra + engineering teams took rapid, decisive action to mitigate downstream impact—so responders knew what was wrong and how to act.

Resilience isn’t luck. It’s preparation, architecture, and practice.

 

Read how we navigated the outage and what teams can learn:
 

 

 

Be the first to reply!