Operations | Monitoring | ITSM | DevOps | Cloud

Configuration drift in enterprise networks: Causes, impact, and management

Network admins want all devices with the same role to behave the same way. But in real environments, that consistency rarely lasts. Imagine two core switches in the same data center. They serve the same function and run the same OS version. One handles traffic without issue, while the other drops packets during peak hours. Logs show nothing obvious. Routing looks correct. The team spends hours checking links, hardware, and traffic paths.

Which Bugs AI Agents Fix Better With Traffic

In the first experiment, I wanted a baseline: if an AI coding agent gets the same production signal a human would get, can it fix bugs in a codebase it has never seen? Yes, but only when I gave it better context. With only an alert, the agent passed 51% of the runtime tests. When I added captured traffic, the actual request and response for the failing call, it climbed to 77%. This post is the second pass.

Connecting Ticketing Systems to Microsoft SCOM

Microsoft SCOM (System Center Operations Manager) remains a widely used enterprise monitoring platform due to its deep integration with Windows, hybrid-cloud support, and extensible management packs. However, the value of SCOM is fully realized only when its alerts seamlessly flow into ITSM or ticketing systems. This ensures incidents are created, routed, and resolved efficiently.
Sponsored Post

Avantra 26: A Breath of Fresh Multi-Tenant AIR

There's a crackle and spark in the air at Avantra lately, and I'm so pleased to be writing this bit on what we've accomplished with the Avantra 26 release. Automated root cause analysis, multi-tenant management support for Cloud ALM, enhanced security operations and financial operations monitoring BTP - it's all there, and more. It's an exciting and innovative release for Avantra!

Why Observability Isn't Enough for AI Coding Agents

Observability platforms collect pre-instrumented logs, metrics, and distributed traces to monitor production systems and surface failures to human engineers. The adoption of AI into engineering has led observability providers to offer those same signals to agents. This is often packaged as AI observability, but the signals themselves were designed around a human investigation loop. AI coding agents work faster, consume data differently, and need feedback as they work rather than after deployment.

Building a resilient workspace with an integrated security framework

Since 2020, the modern workspace has fundamentally changed, where employees now operate across a mix of office, hybrid and remote locations. Critical systems are now distributed between data centres and public cloud platforms, and most corporate data lives in the cloud. This shift has expanded the attack surface for many businesses.

What is Network Monitoring? A Guide for IT Teams

Over 90% of mid-sized and large companies estimate that a single hour of downtime now costs more than $300,000. The clock starts the moment something breaks, whether anyone has noticed it or not. And most outages don't start with alarms. They begin with a small issue inside the network: an overloaded switch, a saturated link, or an unstable interface. Left unnoticed, those small issues grow into user complaints, stalled work, lost revenue, and damaged customer trust.

7 Secure Medical Messaging Apps Private Practices Trust in 2026

For private medical practices in 2026, secure and efficient communication is non-negotiable. Standard consumer messaging apps like iMessage and WhatsApp are not compliant with privacy regulations and create significant risks for both patients and providers. Adopting a dedicated, HIPAA-secure messaging solution is essential for protecting patient data and streamlining clinical workflows.

Instrumenting AI Agents for the Agent Timeline: A Practical OpenTelemetry Guide

AI agents are nondeterministic, multi-step, and opaque. When one fails in production, "the model said something weird" is the cheapest, most useless line in your incident postmortem. To debug agents the way they actually run, you need telemetry that captures all of it, in order, with enough context to reconstruct what happened. The OpenTelemetry GenAI Semantic Conventions give you a vendor-neutral way to do exactly that.

Your AI isn't underperforming. Your data foundation is.

New research reveals why Australian businesses are entering the new financial year with bigger AI budgets and the same unsolved problem. One in three Australian businesses exceeded their AI budget last year. Yet, half of them plan to increase AI spending again this year. Yet the behaviour that caused those budget overruns remains largely unaddressed.