Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

I let Claude investigate a production incident with Honeybadger's MCP server

In this demo, Kevin shows how you can use Honeybadger's MCP server with Claude to investigate a production incident — going from a natural language prompt to a complete incident dashboard in minutes. Honeybadger is an application health monitoring platform that helps developers catch errors, track performance, and stay on top of incidents. The MCP server lets AI assistants like Claude query your Honeybadger data directly, so you can investigate issues conversationally without digging through dashboards manually.

Monitoring and Optimizing a Hybrid Cloud Environment | WhatsUp Gold

This webinar focuses on Monitoring and Optimizing a Hybrid Cloud Environment. Downtime is an expensive inconvenience. Yet many IT teams still face monitoring blackouts due to rigid licensing models and outdated failover strategies. In this session, we’ll introduce a smarter approach: High Availability by Design. Whether you're scaling operations or modernizing infrastructure, this session will enable you with the tools and insights to build a resilient, future-ready monitoring strategy.

Reinventing the Incident Responder's Day: Empowering Tier 2 SOC Analysts with Splunk's Agentic SOC Platform

The Tier 2 SOC Analyst or the Incident Responder (often hailed as the "Sherlock Holmes of the network") faces an increasingly complex and relentless digital landscape. In a world where analysts are being overwhelmed by alerts, held back by fragmented, manual tooling and inefficient workflows, incident responders are charged with the critical task of identifying, analyzing, and mitigating security threats.

Observability Self-Hosted 2026.1 - Additional Cloud Support

SolarWinds Evangelist Chrystal Taylor demonstrates the new cloud entity support features in Observability Self-Hosted version 2026.1. The update adds monitoring capabilities for MySQL and PostgreSQL databases on Google Cloud Platform, GCP load balancers, Azure functions, AWS Elastic Kubernetes Service, and AWS Lambda functions. She provides a guided walkthrough of the dashboard interface, showing how users can monitor various metrics including database performance, network traffic, latency, function execution counts, system usage, and costs across different cloud platforms.

The Evolution of Digital Employee Experience (DEX) | How IT Is Transforming the Workplace

Digital Employee Experience (DEX) is transforming how IT teams support employees, improve productivity, and drive business outcomes. In this video, we explore the evolution of DEX—from traditional reactive IT support to proactive, experience-driven operations that empower both employees and organizations.

The Grafana Cloud identity blueprint: balancing security and scale

If you've ever rolled out Grafana Cloud to a growing engineering organization, this pattern may sound familiar: Everything feels simple at first. You invite a few teammates, give them access, and dashboards start appearing. Then the team grows. Then the number of stacks grows. Over time, a model that once felt fast and empowering starts to feel risky, difficult to understand, and even harder to undo. This post is about avoiding that moment.

Measure and improve mobile app startup performance with Datadog RUM

Mobile app users form opinions quickly. A slow or inconsistent startup experience can frustrate them before they reach the first screen, increasing the likelihood that they abandon the app or fail to complete key actions such as signing up or making a purchase. However, app teams often lack reliable signals that explain why startup performance varies, making it difficult to improve the user experience.