Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Paving the way for a new era: Mezmo's Active Telemetry

The world of software development has fundamentally changed. We've moved from monthly releases to continuous delivery measured in minutes, and the rise of AI means velocity is no longer just a goal—it's a requirement for survival. But this relentless speed has exposed a critical flaw in how we approach observability. The industry relies on a "store first, ask questions later" model where you collect every log, metric, and trace, and then hope to find the root cause when something breaks.

What's New in InfluxDB 3.5: Explorer Dashboards, Cache Querying, and Expanded Control

InfluxDB 3.5 is now available for both Core and Enterprise, along with updates to the new Explorer UI that make it easier to save, organize, and query your data. This release highlights the biggest updates since our 3.4 release, including Explorer Dashboards in beta, new cache querying capabilities, and stronger operational tools for managing clusters. InfluxDB 3 Core is free and open source, optimized for recent data, and licensed under MIT and Apache 2.

Ship features faster and safer with Datadog Feature Flags

Releasing new features is one of the highest-stakes moments in the software delivery life cycle. Even with CI/CD pipelines in place, plenty of things can still go wrong when a feature goes live for actual users. Most feature flagging tools operate in isolation from important observability tooling, forcing engineers to monitor changes across multiple disconnected systems to fully understand their impact. This slows down development and increases the chance of missing critical issues.

How to boost observability ROI with continuous profiling and Grafana Drilldown

For the longest time, observability was centered around logs, metrics, and traces, but the growth of more complex systems has made continuous profiling another essential part of maintaining healthy systems. It provides insights into resource usage and latency down to the code level, delivering key insights to improve performance.

Node.js Monitoring in Serverless Environments - A Complete Guide

Serverless computing with Node.js is transforming how applications are built and scaled by removing the need to manage servers. However, serverless functions run for short durations and scale dynamically, making traditional monitoring ineffective. Effective monitoring is essential to track performance, detect errors, optimize cold starts, and control costs.

Docker Daemon Logs: How to Find, Read, and Use Them

Sometimes Docker behaves in ways that catch you off guard—containers don’t start as expected, images pause during pull, or networking takes longer than usual to respond. In those moments, the Docker daemon logs are your best reference point. These logs capture exactly what the Docker engine is doing at any given time. They give you a running account of system state, performance signals, and events that help you understand what’s happening beneath the surface.

Build on Your Microsoft SCOM Foundation

Enterprises that rely on Microsoft System Center Operations Manager (SCOM) as their monitoring backbone often share an everyday reality: the bigger the environment, the bigger the challenges. Noisy alert storms can bury critical issues. Management Packs (MPs) require ongoing care and expertise to deliver accurate insights. And without consistent reporting, teams risk slipping into reactive fire-fighting instead of strategic monitoring.

Telemetry Now Teaser: "Turning Network Telemetry Into Financial Insight"

Network operators prioritize cost, performance, security, and reliability as their core foundational needs. But how do they get the economic data to make tradeoffs when one of these needs suffers? Tune into the latest Telemetry Now with special guest Lauren Basile to learn how Kentik Traffic Costs is providing data-backed answers to these questions.

How We Built VictoriaLogs Cluster: A CTO's

Go behind the scenes with the VictoriaMetrics team! In this special talk, Marc Sherwood is joined by our CTO, Alexander Marshalov, to explore our powerful, open-source logging solution, VictoriaLogs. This isn't just a feature showcase. This is a deep dive into the engineering mindset that drives our development. Alexander shares firsthand insights into why we built VictoriaLogs Cluster, the technical challenges of creating a distributed system for logs, and the core principles of simplicity and efficiency that guide our architecture.