Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

SRE agent vs. traditional engineer: 7 key differences

The role of a Site Reliability Engineer (SRE) is evolving. The focus has shifted from simply working harder during an outage; A new kind of teammate is here to help: the SRE Agent. But what are the key differences when you compare an SRE agent versus a traditional site reliability engineer? This isn’t just a superficial change. It signifies a fundamental alteration in how teams construct and sustain dependable services.

Do Hospitals Still Use Pagers in 2026? Why They're Not Secure (And What's Replacing Them)

Are hospitals still using pagers in 2026? The answer might surprise you. In this video, we break down why hospital pagers are still used today, the security risks of pagers, and whether they meet HIPAA compliance standards. While pagers have long been trusted for their reliability, many healthcare organizations are now re-evaluating their role in modern clinical communication. We also explore why pagers are considered insecure, including the lack of encryption, no read receipts, and limited communication capabilities, all of which can impact patient care and coordination.

Best Call Routing Software for On-Call Teams in 2026 (After-Hours & Emergency Routing)

Most teams don’t go looking for “call routing software.” They’re trying to solve something more immediate: calls coming in after hours, no clear owner, and something important getting missed.

Do Hospitals Still Use Pagers in 2026? Pager Replacements

Remember the small rectangular devices that could receive short messages? Some may think of it as an outdated device that people have long forgotten about, while others still use it to this day. Pagers, although becoming less and less relevant, are still used by many large hospitals that deem them an essential tool for their day-to-day critical communication. But in 2026, are there pager replacements in the market?

What Is Mean Time to Resolve (MTTR)? (And How to Improve It)

Every minute a network incident goes unresolved costs your company money. Lost productivity, missed SLAs, degraded user experience, and, in other cases, direct revenue loss. For IT teams and network admins, the pressure to resolve incidents fast isn't just operational, it's existential.

What does using AI for post-mortems actually mean?

Everyone is using AI to help with post-mortems now. The pitch is obvious: post-mortems are time-consuming, the blank page is brutal, and AI is very good at producing structured, confident-sounding documents quickly. We're not here to push back on that. We've built AI into our own post-mortem experience, pulling your Slack thread, timeline, PRs, and custom fields together and giving your team a meaningful starting point in seconds. We think that's genuinely valuable, and the teams using it agree.

How it feels to run an incident with AI SRE

We've been building the broader incident.io platform for several years now, and one thing we've learned is that UX matters more here than almost anywhere else. When an incident fires, there's no room for poorly designed interfaces or fumbling through features you haven't touched in a while. The product has to be ergonomic: easy to pick up, easy to navigate, with the right things at your fingertips at exactly the right moment. We've put a lot of effort into this over the last 5 years.

From Static Response to Dynamically Adaptive Resilience

Organizations face an overwhelming mix of digital disruptions: service outages, security incidents, infrastructure failures, all happening faster and with greater complexity than ever before. At the same time, expectations have changed. It’s no longer enough to detect issues quickly or simply notify the right people. The real challenge is what happens next. How do you move from signal to action fast enough, coordinated enough, and with the right decisions at every step?