Operations | Monitoring | ITSM | DevOps | Cloud

Built for Engineers: Datadog's Vision for the Future

Datadog was built by engineers, for engineers. At, Datadog Co-founder & CEO Olivier Pomel opened the keynote with a clear message: observability, security and AI are converging. From infrastructure to AI Agents, the future of engineering requires one unified platform. Catch all product announcements to see what’s next in observability and security on our Youtube channel!

How to Reduce Application Downtime with APM?

According to a recent 2025 study, the average cost of downtime has inched as high as $9,000 per minute for large organizations. For higher-risk enterprises like finance and healthcare, downtime can eclipse $5 million an hour in certain scenarios. Whether you're part of a DevOps team, an SRE, a developer, or an engineering manager, minimizing application downtime should be a critical focus. One of the most effective ways to achieve this is through Application Performance Monitoring (APM).

What's Slowing Down Your App? Common Performance Issues APM Can Solve

Application performance is critical to user experience and business success. When an application starts slowing down, identifying the root cause isn’t always straightforward. For developers, DevOps engineers, and SREs, Application Performance Monitoring (APM) tools provide real-time visibility into how applications behave under load.

Troubleshooting: No data or monitor not created for .NET applications in Site24x7 APM Insight

Are your.NET applications not showing up in Site24x7 APM Insight? This step-by-step video will help you troubleshoot missing data or monitor issues for both IIS-hosted applications and Windows Services. In this video, you'll learn how to: Related links.

Fabric Interconnect: Connecting Servers with UCS Hardware

Every IT decision-maker faces a common challenge: balancing operational efficiency with cost control. While software solutions help streamline operations and drive efficiency, they can introduce redundancies into your system. These redundancies strengthen availability through backup systems but often complicate data management, leading to inconsistencies and potential outages. This is where hardware solutions like fabric interconnects prove invaluable.

Understanding APM and Distributed Tracing in the Observability Stack

To keep modern applications running smoothly, you need more than just basic monitoring. APM (Application Performance Monitoring) gives you a broad overview, tracking metrics like latency, errors, and system health. Distributed Tracing, on the other hand, shows the full journey of each request across services, helping you pinpoint the root cause of slowdowns or failures.

Stay Compliant: Meet Your Audit Needs with Datadog!

Datadog's internal compliance team has built audit workflows and control monitoring capabilities using the Datadog platform. We actively use these capabilities to scale our audit programs and comply with multiple compliance frameworks. This session will go into the details of how we addressed our compliance use-cases using the Datadog platform and how our customers can get started.

Introducing ZTB - Defining Zero Trust for Bring Your Own Cloud (BYOC)

Isn’t the "Bring Your Own Cloud" (BYOC) model the latest hot topic in the evolution of cloud-native architecture, especially for companies offering cloud-hosted platforms that must be deployed in the customer’s cloud for privacy, control, or compliance reasons? Over the past few weeks, we have been rigorously researching and discussing how to build a secure BYOC model.

How Cursor scaled infrastructure rapidly and reliably using Datadog

At Datadog, we use Cursor to empower our teams to build more quickly. And we know that building and troubleshooting with AI tools like Cursor is done best with the right observability data and context. Discover how Cursor was able to rapidly and reliably scale their infrastructure 100x using Datadog to meet the needs of a fast growing user base. And learn more about how we’re bring Datadog tools and context to your favorite AI IDEs and agents with our MCP Server and extensions.

Structured Logging in NextJS with OpenTelemetry

Traces tell you what happened and when. Logs tell you why. When something breaks, logs are often your first clue—and if they’re correlated with traces, they can cut debugging time down from hours to minutes. In this section, we’ll wire up end-to-end structured logging across both server and browser environments in your Next.js app, complete with trace correlation and SigNoz integration.

AI-Augmented Control Plane: Scaling IT Operations with Intelligent Automation

How do you enable a team of 100 engineers to effectively support 300+ critical applications across five hosting platforms? At Thomson Reuters, we turned to AI - not as a buzzword, but as a genuine force multiplier. Experience our journey of transforming traditional IT operations into an AI-augmented powerhouse, where Datadog, ServiceNow, and custom AI solutions work in harmony to create a next-generation control plane. We'll share real victories, honest challenges, and practical insights from our mission to build a more intelligent operational framework.

LLM Observability for Reliability and Stability: A Monitoring Strategy for Phone Communication

LLM APIs offer groundbreaking potential, but also present challenges such as response latency, hallucinations, and service instability. In Japan, where telephone communication remains crucial for business, these issues present significant barriers to the introduction of LLM-based applications. Despite being a relatively young startup, we have developed and deployed an LLM-based telephone service with over 40 million calls.

Monitoring your Nextjs application using OpenTelemetry

Nextjs is a production-ready React framework for building single-page web applications. It enables you to build fast and user-friendly static websites, as well as web applications using Reactjs. Using OpenTelemetry Nextjs libraries, you can set up end-to-end tracing for your Nextjs applications. Nextjs has its own monitoring feature, but it is only limited to measuring the metrics like core web vitals and real-time analytics of the application.

An Easy Guide to Getting Started with Elastic APM

Code in production will break. Maybe a request takes too long, maybe it fails quietly, or maybe it works fine one minute and falls over the next. Logs can help, sure—but they don’t always show the full picture, especially when performance issues are involved. Elastic APM gives you a clearer view. It traces what your application is doing from incoming requests to database queries and everything in between.

DASH by Datadog 2025 Keynote

At the 2025 DASH Keynote and be the first to experience Datadog's latest product innovations. This year, we're unveiling next-generation observability features, innovative ways to secure your AI workloads, and powerful agentic AI capabilities throughout the Datadog platform. Discover the new ways your teams can observe, secure, and act in the age of AI.

CI/CD Observability with OpenTelemetry - A Step by Step Guide

In the fast-paced world of CI/CD, understanding the performance and behaviour of your pipelines is crucial. GitHub Actions has become a popular choice for automating builds and deployments, but anyone who's debugged a flaky workflow or long-running job knows how challenging it can be to get visibility into what's happening under the hood. We usually rely on build logs, timing data, or guesswork when something goes wrong.

Top 13 Open Source APM Tools [2025 Guide]

Choosing the right APM tool is critical. How do you know which is the right one for you? Here are the top 13 open-source application performance monitoring(APM) tools that can solve your monitoring needs. Open-source APM tools have added benefits over their SaaS counterparts. They are more transparent, as you can verify their source code, and you can use them without going through the pains of obtaining approvals usually required for using a third-party vendor tool.

Top 15 Distributed Tracing Tools for Microservices in 2025

In one of our previous blogs, we discussed distributed tracing in depth. We examined why distributed tracing is critical and its components - spans and trace context. You can check the complete guide here: What is Distributed Tracing and How to Implement it with Open Source? Here, we'll look at some of the best distributed tracing tools. We'll see what each of them offers so that you can choose the right tool for your monitoring and observability requirements.

Mastering NodeJS Performance Monitoring - A Practical Guide using Open Source Tools

Node.js powers some of the fastest-growing web applications, but its single-threaded nature makes it vulnerable to memory leaks and CPU spikes. To keep your app running smoothly, especially in production, you need more than just web server logs — you need complete visibility across the entire stack.

How to Collect .NET Application Logs with OpenTelemetry

Observability is essential for maintaining and scaling modern applications. With.NET 8, Microsoft has enhanced support for observability using OpenTelemetry. In this post, we explore how to monitor.NET 8 applications logs with SigNoz, an open-source observability platform, using the OpenTelemetry Protocol (OTLP) exporter.

Monitoring ECS Metrics: A Guide for Developers and Operations Teams

For anyone leveraging cloud computing, Amazon Elastic Container Service (ECS) continues to provide a seamless solution for managing containerized applications. AWS Fargate takes this cloud-native architecture a step further by allowing you to run containers without servers or clusters. As a serverless offering for ECS, Fargate provisions compute capacity and scales it based on demand.
Sponsored Post

Smarter alerts using P75 for more signal and less noise

We've rolled out a new feature in Raygun Alerting that gives you more control over how you track and respond to performance regressions. Starting today, you can now use the 75th percentile (P75) as a filter option for page performance data in Real User Monitoring, such as Core Web Vitals and page load time, right alongside the default 'Average'. This option is available under the "Page/XHR performance change" condition and supports all the Web Vitals metrics we track: Let's break down why this matters, when you should use P75, and how it gives you better, faster insights into how real users are experiencing your site or app.

Java License Monitoring - Why you need to monitor your Java licenses and how to do so

Java license monitoring has now become an essential requirement for many organizations as Oracle’s recent licensing changes have made compliance mandatory, with increased risks of audits and higher Java licensing compliance costs. Once a free programming platform, Java now requires navigating a complex licensing framework, including employee-based models that tie costs to the size of a workforce. These changes significantly increase the risk of unbudgeted expenses for licensing violations.

Monitoring Backstage with OpenTelemetry:Closing the observability blind spot

‘One small step for a man, but a huge leap for developers’ — me, when I realised how to observe my Backstage with OpenTelemetry. Backstage is often the “portal” through which we manage all our other systems, but who watches the watcher? Recently, we gave a KubeCon Talk, highlighting that monitoring Backstage itself is critical. When Backstage isn’t observable, it becomes a blind spot in your infrastructure.