Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

The Shift from Reactive to Proactive Incident Management: What AI Actually Makes Possible

Why enterprise operations teams stop chasing incidents and start preventing them Most enterprise operations teams are faster than they were three years ago. Alert routing is automated. On-call schedules are managed through platforms rather than spreadsheets. MTTR has come down as tooling has improved. On the metrics that measure reactive performance, progress is visible. What has not meaningfully changed is the rate at which the same incidents recur.

MCP Apps: On Call Compensation Report and Service Dependency Graph

This April, PagerDuty's MCP server expands with powerful new capabilities across Analytics & Reporting and Business Services. Teams can now surface aggregate incident data, service metrics, and team metrics — giving operators instant access to the operational insights that matter most. On the Business Services side, the release adds business service dependencies, subscriber management, impacted services analysis, and priority mapping. Rounding out the release are two new MCP Apps (on our experimental branch): Service Dependency graph. and an On-call Compensation report.

Why post-mortem action items die

You can run the best debrief of your life. Honest timeline, blameless tone, real insights. People leave the room nodding. And then nothing happens. This is the last mile problem of post-mortems - and it's an easy trap to fall into. When you've just been through a stressful incident, getting it back up is the priority. Once it's over, the post-mortem itself can feel like the finish line. You've documented what happened, been honest about it, identified what went wrong. It feels like the work is done.

In the Age of AI, Taste Isn't About Aesthetics

AI can generate a UI in seconds. So what do designers actually bring to the table? Marcela, Principal Product Designer at Rootly and former Founding Designer at Ramp, has spent 20 years in design. Her answer: taste isn't about aesthetics or crafting pleasant interactions. It's about asking the uncomfortable questions, and choosing the right problem, not the easiest one.

PagerDuty Invests in the AI-First Operations and Resilience of Healthcare and Crisis Response Organizations

At PagerDuty, we believe operational excellence and social impact are inseparable. As AI rapidly transforms how nonprofits operate, our AI and agentic technology empower mission-driven teams to automate complexity and focus their limited resources on what matters most: delivering reliable services that create meaningful impact at scale.

Why IncidentHub's Alerting is Better than Other Status Page Aggregators'

IncidentHub tracked 48000 SaaS and Cloud outages in 2025. The average organization depends on 100+ SaaS apps, making third-party vendor monitoring a crucial aspect of risk management and business continuity for almost all modern organizations. Better SaaS outage alerting is about monitoring the right parts of your third-party services, and routing alerts to the right people at the right time.

SIGNL4 Update: Stakeholder Communication and Signl Status Notifications

When incidents happen, they rarely stay contained. Customers, partners, and internal stakeholders are often affected – but too often, they’re informed late or not at all. In critical situations, that lack of communication can quickly turn into real business risk. With our latest SIGNL4 release, we’re changing that.

Incident Response Is Broken Without Stakeholders in the Loop

Yet status pages are not enough for modern incident communication. In incident response, the conversation has traditionally centered on speed and resolution – how quickly teams can detect, escalate, and fix issues. But in practice, incidents don’t exist in a vacuum. They ripple outward, affecting customers, executives, partners, compliance teams, and even public perception. That broader circle – the stakeholders – is often underserved by conventional tooling.