Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

7 Best Practices to Improve Digital Employee Experience in Modern IT Environments

Digital employee experience isn’t just a nice to have anymore. In hybrid, SaaS heavy IT environments Digital Employee Experience (DEX) is where productivity can live or die. Employees don’t care whether the culprit is Wi‑Fi connectivity, CPU/RAM load, poor battery life, or a misbehaving cloud app. They just know work got harder.

Auvik Aurora and the Future of AI in IT Operations

We built something called Auvik Aurora, and before you scroll any further, I can already hear your thoughts. “Wait a second, Anto. Is this going to be another blog post giving me the hard sell on using AI?” Fair enough, I don’t think anyone would blame you, especially when we’re seeing AI adoption across nearly every industry, tool, hobby, workflow, or even . The blank is intentional, AI is everywhere, and chances are that you already know that it matters.

Fixing JavaScript observability, one library at a time

Over the past few weeks, we have been driving a cross-ecosystem effort to replace the “monkey-patching” that powers all JavaScript APM tools today with something built into the runtime. Here is why, how, and where it stands. This applies to server-side JavaScript only (Node.js, Bun, Deno, Cloudflare Workers). Browsers do not have diagnostics_channel and lack the async context propagation primitives needed to polyfill it.

ActiveMQ Monitoring & Alerting Setup: The Complete 2026 Guide

Most ActiveMQ outages are not sudden failures. They are visible in the metrics for minutes, sometimes hours, before they become incidents. A memory usage graph climbing past 60%. A queue depth that isn't draining. An enqueue time that doubled after a deployment. A consumer count that dropped from 3 to 1 at 2 AM.

Observability and Security for the AI Era

Datadog has always been driven by a broader vision of helping teams understand and operate complex systems. In this session, you’ll hear from Michael Whetten, Product SVP, and Abrar Hussain, Senior Director, Product Management, as they share the latest updates across the Datadog product suite and discuss how that vision continues to shape the platform’s evolution and support the next generation of AI-driven applications.

Data Sovereignty: How to Keep All of Your Services in Europe (AppSignal + Hatchbox)

Over the last decade, a great deal of data privacy regulations have been passed in the European Union. Like it or not, measures like GDPR, the Digital Services Act, and the upcoming Artificial Intelligence Act are exerting increasing influence across industries over how and especially where the data of European customers is stored. In this article, we will explore the ways to keep the simplicity of a Platform as a Service (PaaS) while utilizing only European providers.

Faster OpenTelemetry Migrations from Splunk to SecOps with Bindplane

Many security teams are looking to move off Splunk, whether to reduce licensing costs, consolidate their SIEM, or take advantage of Google SecOps' built-in threat intelligence and YARA-L detection capabilities. But migrations aren’t easy, and no one wants to run blind while they evaluate and move to a new platform. With OpenTelemetry and Bindplane, you can easily make the switch to SecOps without impacting your existing stack.

How one partnership powers search for over 2 million WP Engine users

How do you make search faster, smarter, and more scalable? During our recent webinar, I sat down with Luke Patterson, senior product manager at WP Engine, and Delphin Barankanira, independent software vendor partner engineering lead and data & AI specialist at Google Cloud, to answer that question. We dug into the mechanics behind WP Engine’s ability to deliver near-instant updates to over 2 million users.

Eliminate noisy log lines with Adaptive Logs drop rules

Most platform and observability teams have logs they know are noise. These could be throwaway health check logs, forgotten DEBUG logs, or verbose INFO logs from little used services that only serve to inflate your bill. Regardless of what they contain and why they're there in the first place, the hard part is getting rid of them. Centralized teams want to easily and quickly prevent these logs from being ingested, without having to work with toilsome infrastructure change management to do so.