Operations | Monitoring | ITSM | DevOps | Cloud

Introducing the Service Map in Datadog

When your pager goes off at 3:00 a.m. and you need to begin your investigation, where do you start? Before you can attack the problem, you need to know the lay of the land: What else is potentially affected by this failing service? What are its dependencies? Where are the probable root causes? With the new Datadog Service Map, you can visualize the topology of your application to answer these questions and more.

Monitoring multi-cloud container storage with Portworx and Datadog

Portworx provides solutions for Kubernetes storage as well as other leading container schedulers, dramatically reducing storage, compute, and infrastructure costs for running mission-critical, multi-cloud applications with zero downtime or data loss. With Portworx, you can manage any database or stateful service on any infrastructure using any container scheduler. Portworx is trusted by many of the world’s most sophisticated IT organizations including Comcast, GE, Lufthansa Systems, the U.S.

Collecting metrics with IIS monitoring tools

In this post, we’ll show you how to use built-in IIS monitoring tools to access and graph performance counters, configure logging in IIS, and query your logs with Microsoft’s Log Parser Studio. We’ll also explain how to use a diagnostic tool to investigate memory leaks and high CPU utilization in your application pools and worker processes.

Key IIS metrics to monitor

Microsoft’s Internet Information Services (IIS) is a web server that has traditionally come bundled with Windows (e.g., versions 5.0, 6.0, and beyond). IIS has numerous extensibility features. Swappable interfaces like ISAPI and FastCGI make it possible to use IIS with a variety of backend technologies, from micro-frameworks like Flask to runtimes like Node.js, along with technologies you’d expect to find within a Windows-based production environment (e.g., ASP.NET).

Node.js monitoring with Datadog APM and distributed tracing

Node.js is an asynchronous JavaScript runtime that is used to develop highly scalable network applications. To help provide more visibility into these dynamic environments, we’re pleased to announce that Datadog APM has officially released support for monitoring Node.js applications, which joins our existing support for Java, Ruby, Python and Go.

Watchdog: Auto-detect performance anomalies without setting alerts

With anomaly detection, outlier detection, forecasting, and composite alerting, Datadog enables you to reliably alert the right people at the right time. But what happens when latency starts to increase, or error rates spike, in areas of your application where you haven’t set alerts? That’s what Watchdog is for.

Introducing APM Trace Search & Analytics with infinite cardinality

Distributed tracing provides a detailed view into application performance. Each trace shows you how an individual request was executed in your app: which user did what, which services were involved, how long it took, and whether the request executed successfully. Capturing that level of detail across hundreds or thousands of services provides a vast trove of information for troubleshooting and performance optimization, but it’s not always easy to find the exact trace events you need.