Operations | Monitoring | ITSM | DevOps | Cloud

The Unplanned Show, Ep. 29: Major Incident Management with Davis and Chris

Not all incidents are created equal. How do you handle major incidents so that they don't spiral into a chaotic mess, incinerating productivity across too many teams? How do you prevent major incidents and learn from the ones you've had? "Major Incident Management" has been a practice for a long time, but as companies depend even more on digital services and revenue channels, while trying to do more with the same or less, something has to change.

How Complyt is using Datadog APM and distributed tracing to reduce application response times

Learn how Complyt is using Datadog Application Performance Monitoring (APM) and distributed tracing to turn data into knowledge and reduce application response times by more than 80%, which enabled them to meet SLAs for their largest customers.

How do you build resilient systems to manage the IPL with 30+ million concurrent users?

The Indian Premier League is a unique sporting event for a dozen reasons. But for engineers in India, it’s one of a kind. Very few companies can boast of managing 30+ million concurrent users. Every year, this number grows. Last year, we witnessed ~60 million concurrent users. And things get bigger and larger every year.

Deliver Better Customer Experiences with PagerDuty for Customer Service

Want to deliver better customer experiences and meet your SLAs? PagerDuty for Customer Service Operations helps organizations connect the right teams at the right time, address urgent tickets, efficiently scale their 24-7 customer support model, and enhance cross-functional collaboration.