Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Network traffic analysis for today's IT

When there is a radical evolution of technologies that promise improved operational benefits, many challenges beyond a network administrator's typical scope emerge. Organizations need to determine effective strategies to manage the potential setbacks that can result from these complexities as well as address the evolution of cyberthreats. With network traffic analysis and awareness of the potential challenges these technologies pose, network admins can ensure their network remains resilient.

Unlocking insights: Learn to deal with deadlocks and blocks with an SQL monitor

Deadlocks and blocks are two types of concurrency issues that can occur in an SQL Server environment. Understanding and addressing these issues is crucial for ensuring the performance and reliability of your SQL-based applications. First, let’s look at the concept of locks, blocks, and deadlocks.

MAUI provider upgrades v2: Real User Monitoring + Crash Reporting

I’ve written previously about the process of adding Real User Monitoring capabilities to our MAUI provider. I’m excited to say that this work is now live, batteries and all, plus some more improvements since the last blog. To recap the state of cross-platform development in the.NET ecosystem, Xamarin is out of support as of May 1st! This is replaced by.NET MAUI (Multi-platform App UI), meaning developers need appropriate tools when they make the switch.

Raygun4Aspire: (Free) lightweight Crash Reporting running locally

NET Aspire is a new type of project and set of NuGet packages that make it easier to coordinate the multiple moving parts of a cloud-native web application. Announced near the end of 2023, .NET Aspire is currently in Preview 6, so still a work in progress. We’ve just released Raygun4Aspire, our Crash Reporting client for Aspire applications.

Aggregate, correlate, and act on alerts faster with AIOps-powered Event Management

Maintaining service availability is a challenge in today’s complex cloud environments. When a critical incident arises, the underlying cause can be buried in a sea of alerts from interconnected services and applications. Central operations teams often face an overload of disparate alerts, causing confusion, delayed incident response, alert fatigue, and redundant resolution efforts. These issues can negatively impact revenue and customer experience, especially during an outage.

Track changes in your containerized infrastructure with Container Image Trends

Datadog’s Container Images view provides key insights into every container image used in your environment, helping you quickly detect and remediate security and performance problems that can affect multiple containers in your distributed system. In addition to having a snapshot of the performance of your container fleet, it’s also critical to understand large-scale trends in security posture and resource utilization over time.

Managing High Volume with OpenTelemetry

As your systems grow, so do the challenges of managing high-volume telemetry data. From horizontal scalability strategies to efficient data aggregation and storage techniques, we'll cover everything you need to know to keep pace with your expanding infrastructure. Don't let scalability constraints hinder your observability efforts—learn how OpenTelemetry can empower you to manage high volumes of telemetry data effectively and efficiently.