Operations | Monitoring | ITSM | DevOps | Cloud

Events, Alert, and Incidents: What's The Difference? How Do They Relate?

Effectively managing events and alerts is essential for preventing or quickly resolving incidents, whether it’s a sudden service outage or an ongoing cyberattack. The three terms — events, alerts, incidents — are different but they are closely related. Read on to learn more. Ensuring the reliability, performance, and efficiency of IT systems is both the heart of operational excellence and an important strategic objective for digital organizations.

Building a Simple Synthetic Monitor With OpenTelemetry

Using server-side telemetry to understand what’s going on inside your system is incredibly valuable, but what about the responsiveness the user actually sees? In this post, I’ll cover what synthetic monitoring is and show an example of how you can create a simple monitor using OpenTelemetry, .NET, and an Azure function. If you only want to see how it’s built, skip ahead to building a synthetic monitor.

Find and fix CI build errors with AI

Software teams rely on CI/CD pipelines to build, test, and deploy code quickly. But when a build fails, it can disrupt the entire workflow. Digging through logs, chasing down errors, and switching between dashboards takes time you don’t want to waste. In this tutorial, you’ll learn how to use your AI coding assistant — powered by structured data from your CI system — to diagnose and fix build failures faster.

How Console Connect is empowering data centre partners with enhanced connectivity

The digital landscape is evolving at a rapid pace, and businesses are constantly seeking agile, high-performance connectivity solutions to stay ahead. Recognising this, data centres are shifting beyond their traditional role of offering just space and power. Today, they must deliver real-time access to clouds, other data centres, and applications.

How does website monitoring even work?

Every website manager knows that feeling when you look at your inbox only to find a customer notifying you that a core page of your site is down. The worst part of it all, you don’t know how long that page has been down for. If you’ve yet to experience that, count your blessings. Well, unless you decide to opt for a website monitoring solution before it happens to you. With website monitoring, you can ensure every page on your site is up and running at all times.

Extending the Capabilities of DX Unified Infrastructure Management: Release 23.4 CU4

Release 23.4 Cumulative Update 4 (CU4) for DX Unified Infrastructure Management (DX UIM) adds significant improvements to the product’s security stance and extends technical currency to support modern infrastructures. The release builds on the proven track record of DX UIM to deliver enterprise-ready capabilities and monitoring coverage, while meeting the highest standards for security, scalability, and performance.

Integrate PagerDuty with ServiceNow to Improve Major Incident Management

Downtime isn’t just an inconvenience—it’s a revenue killer that can cost millions and shatter customer trust. While critical incidents pile up in ticketing queues, support teams drown in manual triage, racing against time to spot fires before they become infernos. Enter the PagerDuty Operations Cloud + ServiceNow integration.

How To Pick The Correct Metrics For Your Monitoring

This is a guest blogpost by Adam Sweet from the Icinga Partner Transitiv Technologies. Since this is a longer post, we added a tl;dr at the end. For many, host and application monitoring is an afterthought at the end of a project. Some people don’t think about monitoring at all until a few failures go unnoticed and a customer or end-user calls to ask why something isn’t working.

Prometheus Distributed Tracing: An Easy-to-Follow Guide for Engineers

When your microservices architecture starts growing, tracking requests as they bounce between services becomes a real headache. You know the feeling—a user reports a slow checkout process, and you're left wondering which of your twenty services is the bottleneck. That's where distributed tracing with Prometheus comes in.

What is API Monitoring and How to Build API Metrics Dashboards

In today's connected world, APIs are the backbone of modern applications. Whether you're working on a microservices architecture, a mobile app, or a SaaS platform, APIs are what keep everything talking to each other. But how do you know if your APIs are healthy, performing well, and delivering what your users need? That's where API monitoring comes in. Let's break down what API monitoring is, why it matters, and how you can build effective API metrics dashboards to keep your systems running smoothly.