Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

ITSM vs. ITOM: What are the key differences?

IT service management (ITSM) and IT operations management (ITOM) both have the mandate to ensure your organization’s IT systems and infrastructure run smoothly and efficiently. These two frameworks are essential for any modern IT environment, but their roles are often confused or misunderstood. Simply put, ITSM focuses on the user-facing side of IT, streamlining services and aligning IT processes with business objectives.

Zenduty is joining Xurrent!

We launched Zenduty just two months before the onset of the COVID-19 pandemic with a mission to redefine incident management by providing a robust, reliable, and intelligent platform for IT operations teams, DevOps, and SREs. At a time when businesses were rapidly shifting to remote operations and dealing with unprecedented challenges, the need for a resilient and intelligent incident management platform became more critical than ever.

Best incident management tools in 2025 [45 analyzed]

PagerDuty, Splunk, ServiceNow — with dozens of incident management tools on the market, how do you know which one to choose? Here's the reality — downtime costs organizations an average of $9,000 per minute. That's why companies are increasingly investing in incident management tools to reduce disruption and improve their incident response. But with the market evolving rapidly and new players emerging constantly, selecting the right tool has become more challenging than ever.

Shorten your MTTR with Checkly Traces

We all know that Checkly is a ‘secret weapon’ for engineering teams who want to shorten their mean time to detection (MTTD). With Checkly, you can know within minutes if your service is unavailable for users, or acting unexpectedly. In this article we’ll talk about how Checkly traces can help you expand on the benefits of Checkly, adding insights that will help you diagnose root causes, and further reduce your mean time to resolution (MTTR) for outages and other incidents.

Weaving AI into SIGNL4

Over the past two years, artificial intelligence (AI) has experienced remarkable growth, significantly influencing various sectors and daily life. In 2023, the release of advanced large language models (LLMs), such as OpenAI’s GPT-4 and Google DeepMind’s Gemini, marked a pivotal shift by enabling AI systems to process and generate diverse data types, including text, images, and audio.

PagerDuty Operations Cloud Spring 25 Release: Reimagining Operations in the Age of AI and Automation

Operational excellence isn’t just a goal—it’s critical for survival for all companies. And, when powered by AI and automation, it’s a strategic competitive differentiator. With over a decade of AI and ML experience in our platform, PagerDuty pioneered the Incident Response space. And now, PagerDuty is redefining what modern operations can look like in the era of AI and automation.

Microsoft Entra ID Outage: How Vantage DX Detected the Issue Before Microsoft Acknowledges the Issue

On February 25, 2025, at 11:32 AM EST, Martello’s Vantage DX monitoring began alerting on an issue affecting Microsoft Entra ID (Azure AD SSO). While Microsoft had not yet acknowledged the incident, online reddit forums had noted the issue and our Vantage DX proactive monitoring detected disruptions impacting authentication across multiple workloads. See here the critical warning for Exchange in Vantage DX Monitoring. Here is the critical warning for OneDrive and SharePoint in Vantage DX.

February 2025 Box Outage: Timeline and Post-Mortem

Box.com is a cloud-based content management and file-sharing platform designed for the enterprise and used by nearly 100,000 companies around the world. When a Box outage strikes, businesses can experience costly disruptions. On February 19, 2025, a disruption in core Box services including uploads, downloads, and the All Files page, affected thousands who depend on the cloud storage and collaboration platform.