Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Rootly On-Call: On-Call Shadowing Feature

Shadowing experienced responders is one of the most effective ways for folks who are new to on-call to gain the confidence and knowledge to handle incidents independently. Traditionally, shadow rotations are cumbersome to set up, involving duplicating and editing an existing schedule. For Rootly On-Call users, setting up shadow rotations couldn’t be easier with our new native Shadowing feature. Here are a few highlights.

NYSE uses AIOps to identify problems faster and focus on innovation

The New York Stock Exchange relies on AIOps to extract crucial incident insights, allowing IT teams to focus on innovation instead of manually investigating alert data. Chuck Adkins, CIO, shares how an AIOps tool helps the NYSE save time and resolve problems instead of searching through alerts to find them.

Enable ilert Intelligent Alert Grouping

Intelligent alert grouping is a new feature of ilert. It is powered by ilert AI and designed to prevent alert fatigue. The feature combines alerts into groups based on their content. Our video explains how to enable alert grouping for your alert source and how to adjust the accuracy of the grouping. The feature is a part of the new powerful ilert add-on and is currently available at no extra cost during the Beta phase.

Network topology: Definition and role in observability

Network topology describes how a network‘s nodes, connections, and devices physically arrange and interconnect, as well as how they communicate. The arrangement or configuration of a network’s components plays a crucial role in ensuring smooth ITOps with minimum downtime. Any issues in the network can disrupt operations, leading to potentially dire consequences. To prevent this, you need to understand your network functionality and structure.

Demo Roundups! Scale Support Teams with PagerDuty's CX Operations

PagerDuty’s Solutions Consulting Team Lead Michael Aravopoulos presents an exclusive live demo showcasing PagerDuty's Customer Service Operations capabilities. Identify and address issues before they affect your customers Automate incident discovery and response to deliver streamlined digital experiences Facilitate communication and coordination between customer service and technical team.

Effective Slack on-call protocols for engineers

Talks about being on call are usually met with complaints. Here's how to alter the narrative and develop a stronger, more compassionate process. A few years ago, I took oversight of a significant portion of our infrastructure. It was a complex undertaking that, if not managed and regulated properly, could have resulted in major disruptions and economic consequences over a large area.

Steps to AIOps maturity: Establish actionable incidents

Lack of communication between IT operations and ITSM teams results in data silos. And data silos make it challenging, if not impossible, to solve problems efficiently. One-third of ITOps professionals say that gathering business context is the biggest challenge to effective incident response and management, according to EMA Research.

Evaluating Opsgenie Alternatives in 2024

In today’s digital age, customer expectations are at an all-time high, with demands for instant support, flawless user experiences, and constant service availability. This environment of heightened expectations pushes organizations to innovate and streamline their operations continuously. Ensuring seamless service delivery hinges on the ability to detect and resolve issues swiftly, whether they are server crashes, software bugs, or unexpected outages.