Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

What's New in InfluxDB 3 Explorer 1.7: Table Management, Data Import, Transforms, and More

InfluxDB 3 Explorer 1.7 is a step forward for anyone who wants to manage their time series data without constantly switching between the UI and a terminal. This release adds table-level schema management, the ability to import data from other InfluxDB instances, and a new Transform Data section to reshape your data, all within the Explorer UI.

Ephemeral Leaks and Automated BGP Route Leak Detection

Many BGP route leaks reported by automated detection systems are actually brief, low-impact artifacts of normal BGP convergence. Doug Madory examines examples from Cloudflare Radar, Routeviews, and Jared Mauch’s long-running leak detector to show how these “ephemeral leaks” arise, why they usually don’t disrupt traffic, and why they still matter for routing security.

N+1 Detection in AppSignal's OpenTelemetry Trace Timeline

N+1 query problems are one of the most common, and quietly damaging, performance issues in production applications. One extra query per record feels harmless in development. At scale, it becomes the reason your response times degrade and your database buckles under load. Today, AppSignal adds N+1 detection to its OpenTelemetry support. When we identify the pattern in a trace, we collapse the repetitive spans directly in the timeline, making the problem immediately visible in the trace itself.

Grave improvements: Native crash postmortems via Android tombstones

Native crashes on Android have always been harder to debug than they should be. The platform has its own crash reporter (debuggerd) that captures the crashing thread, every other running thread, register state, and memory maps into a file called a tombstone. Tombstones have been a part of Android for a long time; in fact, they’ve been there in one form or another since Android's first commit.

What Is an AI SRE? And Why Do They Need Live Runtime Evidence?

AI SREs are autonomous systems that handle incident triage, root cause analysis, and remediation by correlating logs, metrics, traces, and code signals. However, as they rely on pre-configured telemetry, the critical execution details of a specific failure, such as variable state and code paths, can often be missed. As a result, they either force users into manual redeploy loops or make inferences from partial data, diagnosing issues using probability rather than proof.

Site24x7 MSP: The all-in-one platform for managed service providers

Managing dozens of client environments you don't own, behind firewalls you can't see through, while keeping SLAs intact is the essential MSP predicament. Site24x7 MSP is a cloud-native platform built to solve it. From a single multi-tenant console, monitor servers, networks, applications, and cloud workloads across AWS, Azure, and GCP with agent-based telemetry that catches issues before they escalate. True data isolation and RBAC keep client accounts secure. White-labeled portals, domains, and agents make it look like your platform. AI-powered self-healing workflows resolve incidents automatically.

What Are DNS Records? DNS explained in simple terms | A complete guide

Learn how DNS (Domain Name System) works and why it's called the internet's phone book. This video breaks down the entire DNS resolution process, from cache checks to root servers, and covers every essential DNS record type, including A, AAAA, CNAME, MX, NS, SOA, TXT, PTR, SRV, and CAA records.

Best Digital Experience Monitoring Solutions: 2026 Buyer's Guide

A website that loads slowly or an application that freezes mid-transaction tells users something about an organization, whether intended or not. Digital experience monitoring exists to catch these moments before they accumulate into lost customers and frustrated employees. We’ll show you how DEM works, the leading platforms available, and how to select the right solution for specific organizational needs.

Top 5 Zabbix Dashboarding Tools Compared

Zabbix collects a huge amount of operational data—metrics, alerts, host status, and performance trends. But turning that data into dashboards people actually use is a different challenge. Most teams start with the built-in dashboards. Then the requests start coming: At that point, basic dashboards aren’t enough. Teams start looking for ways to augment Zabbix visualization with tools that improve usability, sharing, and flexibility.

Icinga as Open-Source MSP Monitoring Software: Multi-Tenant Monitoring for IT Service Providers

If you run a managed service provider, your RMM software is the backbone of daily operations. Remote management, patch cycles, ticketing workflows – it handles the essentials. But if you’re monitoring more than a few dozen client environments, you’ve likely noticed that monitoring and management are not the same thing. And that difference matters more the larger you grow. This post is not about replacing your RMM.