Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

8 Video Workflows That Optimize IT Operations

It wasn't that long ago when Agile revolutionized IT workflow, introducing a feedback-forward process that ensured each project task was perfected and approved before moving on to the next. To execute a task with high precision, an assigned team needs a reliable arsenal of tools, including video. Project managers also need updated tool stacks to lead complex projects to completion.

Turning team knowledge into Alert Routing rules

Over time, on-call teams build up a quiet layer of knowledge about their systems. Someone learns that a specific error code always means phone calls are failing. Someone else figures out that a particular background job fires a warning every night and has never once needed attention. That knowledge shapes how your team responds to incidents every day. But when it only lives in people’s heads, your response depends entirely on the right person being available at the right time.

Do Veterinarians Go On Call? Reinventing OnCall Management for Veterinary Clinics

Veterinary clinics typically operate during standard 9–5 business hours. But emergencies don’t follow a schedule. The puppy you just brought home might decide that the rubber duck your toddler dropped on the floor looks like the perfect snack. Or your dog might get into a box of Valentine’s Day desserts you left on the counter. Suddenly, what seemed like an ordinary evening turns into a frantic search for help.

The Hidden Cost of AI Productivity: When Efficiency Turns Into "Brain Fry"

A new HBR study reveals that the race to build and manage AI agents may be pushing knowledge workers toward a new form of cognitive overload. If you spend any time on LinkedIn these days, you’ve probably seen the same type of post over and over. Someone proudly announces they built an AI agent that now writes their emails, analyzes data, drafts presentations, and maybe even ships code.

The Path to Autonomous Operations: PagerDuty Spring 26 Release

Shipping velocity has never been faster, but reliability can’t be the trade-off either. For engineering leaders, deploying AI for operations is no longer optional. The question is whether you’ll lead the transformation or fall behind. The hard truth? Organizations can’t keep relying on humans as the first line of defense. Not when the pace of shipping has never been faster. It’s simply not scalable.

On-call compensation for IT engineers in 2026

Imagine it’s 2 AM and a critical system flatlines without warning. A bleary-eyed on-call engineer scrambles to restore service, shielding customers from a major outage that could torpedo your next Service Level Objective (SLO) review. Yet when daylight returns, debates over fair on-call compensation start all over again: What’s “just” pay for sleepless nights, unpredictable pings, and rapid-fire incident responses?

Do Veterinarians Go Oncall? And How Does It Work?

Veterinary clinics typically operate during standard 9–5 business hours. But emergencies don’t follow a schedule. Having the option to reach an on-call veterinarian through a dedicated after-hours emergency line provides peace of mind not only for pet owners, but, believe it or not, for veterinarians as well. So how does ONCALL work for veterinary clinics? Find out more through our Doggy Explain video.#dog.

How to set up Alert Routing rules effectively

Different incidents need different levels of attention. Some need a phone call at 3 AM and others can wait until morning. Alert Routing rules are what let you act on that understanding without doing it manually every time. An effective routing setup does three things: Getting all three of these working is what makes a routing setup useful.

Global Industrial Leader Coordinates Severity 1 Incidents with Clarity and Speed

“The first 15 minutes of a Sev-1 incident often determine the next 15 hours.” For a multi-billion dollar global industrial leader, managing Severity 1 incidents across a complex, distributed infrastructure is a high-stakes operation. When systems go down, the impact is felt instantly across production lines and global logistics.