When the internet blinked, teams needed clarity fast.
On Oct 20, 2025, a major AWS us-east-1 outage rippled across SaaS, messaging, conferencing, and cloud providers—reminding us that incidents aren’t an if, they’re a when.
PagerDuty’s core incident notifications stayed resilient, and our infra + engineering teams took rapid, decisive action to mitigate downstream impact—so responders knew what was wrong and how to act.
Resilience isn’t luck. It’s preparation, architecture, and practice.
Read how we navigated the outage and what teams can learn:
When the Internet Blinked: How PagerDuty Stayed Resilient During the October 20 Outage
By Rukmini Reddy, Sr. VP of Engineering, PagerDuty