#OpsLife


AMA Come ask our participants anything DevOps, incident response, or infrastructure-related! How-tos, best practices, their opinions on a particular way of doing things. You know you want to.
Topic Replies Activity
Virtually Vote in our Resource Guide Poll 1 September 24, 2019
has anyone user pager duty alert to restart windows server or windows service? 1 May 13, 2019
Is it correct to incorporate Pending status in tool for Incident Management? 1 June 14, 2019
Tips for not waking a SO while on-call 5 April 29, 2019
Video AMA: The Art and Science of Resiliency with John Allspaw
AMA
8 February 11, 2019
Postmortem Tips and Tricks 1 February 1, 2019
Video AMA: Chaos Engineering with Ana Medina
AMA
8 January 29, 2019
Happy Cybersecurity Month everyone! Some resources to share... 1 October 8, 2018
Video AMA: Postmortems and More with J. Paul Reed
AMA
15 September 11, 2018
Video AMA: Alice Goldfuss
AMA
10 August 2, 2018
Video AMA: Running the Infrastructure of Open Source with Ashley Williams
AMA
4 June 25, 2018
Video AMA: Humane On-Call with Jeff Smith
AMA
5 March 22, 2018
What topics do you follow the best blogs on? 4 February 6, 2018
John Allspaw article on post-mortems 1 December 19, 2017
Our first outage 1 August 23, 2017
How do you cut a monolith in half? 1 July 10, 2017
Don't Settle For Eventual Consistency 1 July 1, 2017
Things I Learned Managing Site Reliability for Some of the World’s Busiest Gambling Sites 1 June 26, 2017
On failure and resilience 1 June 23, 2017
AMA with authors of Zero-Trust Networks: August 17, 2017
AMA
27 August 17, 2017
Introducing tracing into existing metrics and logging infrastructure 2 June 16, 2017
AMA with Charity Majors - June 15, 2017
AMA
44 June 16, 2017
Unusual implementation: TicketDuty 3 June 15, 2017
About the #OpsLife category 3 March 12, 2019
"Why is it raining in the datacenter?" 1 May 15, 2017
PagerDuty Incident Response Documentation 2 May 8, 2017