When the internet blinked, teams needed clarity fast.
On Oct 20, 2025, a major AWS us-east-1 outage rippled across SaaS, messaging, conferencing, and cloud providers—reminding us that incidents aren’t an if, they’re a when.
PagerDuty’s core incident notifications stayed resilient, and our infra + engineering teams took rapid, decisive action to mitigate downstream impact—so responders knew what was wrong and how to act.
Resilience isn’t luck. It’s preparation, architecture, and practice.
Â
Read how we navigated the outage and what teams can learn:
Â
When the Internet Blinked: How PagerDuty Stayed Resilient During the October 20 Outage
By Rukmini Reddy, Sr. VP of Engineering, PagerDuty
Â
Â