Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Top 5 EdTech outages detected by StatusGator in July 2025

July 2025 saw several significant service disruptions affecting the education technology (EdTech) ecosystem. From online learning platforms to creative tools used by teachers and students, these outages caused widespread frustration. StatusGator monitored and detected these incidents, providing early alerts to help schools and organizations stay informed.

We built an MCP server so Claude can access your incidents

"Show me all critical incidents from the last week." "Create an incident for the payment API being down." "What was the root cause of that database incident last Tuesday?" If you've ever wished you could just ask Claude (or any MCP client) to handle incident management tasks instead of context-switching between chat and your incident management dashboard, you're going to like what we built.

EMEA Rundeck by PagerDuty Meetup - July 2025

Join us for an informal 1-hour virtual event where the open-source Rundeck by PagerDuty community comes together to share automation stories and use cases. Whether you're new to Rundeck or looking to elevate your automation game, this meetup is packed with valuable takeaways for everyone! Host: Martin Van Son, Automation Specialist & Strategic Solution Advisor at PagerDuty New OSS Dashboards & Enterprise ROI Plugin + Creating Rundeck Plugins with Claude Code.

AMER Rundeck by PagerDuty Meetup - July 2025

Join us for an informal 1-hour virtual event where the open-source Rundeck by PagerDuty community comes together to share automation stories and use cases. Whether you're new to Rundeck or looking to elevate your automation game, this meetup is packed with valuable takeaways for everyone! Host: Forrest Evans (Director, Product Management at PagerDuty) Rundeck by PagerDuty: A Swiss Army Knife of Automation.

Incident Commander Role: Responsibilities and Best Practices

When a critical system goes down at 3 AM, the difference between a quick resolution and hours of costly downtime often comes down to one role: the incident commander. This person serves as the central coordinator during IT incidents, making crucial decisions that can save thousands of dollars per minute.

What Is a Rapid Response Team (RRT) in Hospitals? Why Do They Matter?

Imagine you’re working on a hospital floor when suddenly a patient’s condition starts to deteriorate. What happens next can mean the difference between life and death. That’s where a Rapid Response Team (RRT) steps in: a specially trained group of healthcare professionals who respond quickly to patients showing early signs of crisis to prevent emergencies like cardiac arrest or respiratory failure. But how common are these teams? What do they really do day-to-day?

EU AI Act: what changes in August 2025 and how to prepare

‍ On August 2, 2025, a key part of the EU AI Act comes into force. It has serious implications for how you manage incidents related to artificial intelligence. ‍ While the full regulation will not apply until 2026, new obligations for providers of general-purpose AI (GPAI) models begin this summer. If you are building or deploying AI-powered services in Europe, the clock is ticking.

Why Monitoring Heartbeat Events with PagerDuty AIOps is the Future of System Health Tracking

Organizations migrating from Opsgenie and other legacy incident management platforms are discovering that basic connectivity monitoring isn’t enough for modern operations. While Opsgenie Heartbeats and similar traditional heartbeat features offer simple binary status checks of system availability, PagerDuty’s AIOps-powered approach transforms system health monitoring from reactive alerting into intelligent, automated operational intelligence.

PagerDuty Named a Leader and Outperformer in the 2025 GigaOm Radar for AIOps

There’s no shortage of hype around AI in operations, but recognition from a trusted source like GigaOm cuts through the noise. We are excited to share that PagerDuty earned a top spot as a Leader and Outperformer in the 2025 report. It’s recognition that reflects the progress we’ve made in delivering an AI-powered platform that actually helps teams move faster, reduce costs, and operate with confidence in complex environments.

10 Best Live Call Routing Software for Incident Management

I curated a list of the 10 best Live Call Routing software for incident management. To compare them, I created a checklist of essential features. I then read their documentation to see how they stacks up against my checklist. And finally, I encapsulated the results in three tables: If you are new to live call routing, I’ve included a section that covers the basics for you. Let’s get started! Key highlights.