%term

Define, run, and scale custom LLM-as-a-judge evaluations in Datadog

Nov 25, 2025 By Rashel Hoover In Datadog

Teams deploying LLM applications face a critical blind spot: They can measure speed and cost, but not whether their AI is actually giving good answers. To build user trust in these applications, teams also need to measure response quality, including factual accuracy, safety, and tone. Operational metrics show how a system behaves, but not whether its responses are correct or on brand.

Read Post

Datadog

Read more about Define, run, and scale custom LLM-as-a-judge evaluations in Datadog

Build custom apps in seconds with conversational AI in App Builder

Nov 24, 2025 By Barak Shoushan In Datadog

Datadog App Builder is a low-code tool for creating internal apps, making use of a drag-and-drop interface that allows engineering teams to troubleshoot issues, optimize operations, and enable self-service while connecting directly to their Datadog data and permissions. Now, with conversational AI, teams can go from idea to working prototype even faster.

Read Post

Datadog

Read more about Build custom apps in seconds with conversational AI in App Builder

Introducing Bits AI SRE, your AI on-call teammate

Nov 24, 2025 By Datadog In Datadog

Bits AI SRE is your AI on-call teammate, built to autonomously investigate alerts and coordinate incident response. Integrated with Datadog, Slack, GitHub, Confluence, and more, Bits analyzes telemetry, reads documentation, and reviews recent deployments to determine the root cause of alerts—often before you’ve even opened your laptop. In fact, if you're using Datadog On-Call, you can view Bits’s findings right from your phone—so you’re always one step ahead, no matter where you are.

View Video

Datadog

Read more about Introducing Bits AI SRE, your AI on-call teammate

Data Observability: Build confidence in the data life cycle

Nov 21, 2025 By Datadog In Datadog

Datadog Data Observability provides a complete solution with quality checks (e.g., volume, row changes, freshness), custom SQL-based monitors, anomaly detection, column-level lineage across systems like Snowflake and Tableau, full pipeline visibility, and targeted alerts when data issues arise.

View Video

Datadog

Read more about Data Observability: Build confidence in the data life cycle

Coordinate large-scale engineering initiatives with IDP Campaigns

Nov 21, 2025 By Amrita Lakhanpal In Datadog

As organizations grow, engineering leaders often need to drive cross-team initiatives such as reducing cloud spend, upgrading runtimes, or strengthening security controls. Tracking this work can quickly become fragmented across spreadsheets, dashboards, and status meetings. Progress is hard to measure, accountability is unclear, and the impact of each effort can be difficult to demonstrate.

Read Post

Datadog

Read more about Coordinate large-scale engineering initiatives with IDP Campaigns

Use OpenTelemetry with Observability Pipelines for vendor-neutral log collection and cost control

Nov 20, 2025 By Micah Kim In Datadog

Today, many DevOps and security teams operate in a world of complex, hybrid, or multi-vendor environments. As more teams look to avoid lock-in by adopting open standards, OpenTelemetry (OTel) is quickly gaining adoption as the primary open source method for DevOps and security teams to instrument and aggregate their telemetry data. However, OTel alone may lack the advanced processing functions, native volume control rules, and hybrid environment support that large organizations need.

Read Post

Datadog

Read more about Use OpenTelemetry with Observability Pipelines for vendor-neutral log collection and cost control

How Datadog Feature Flags is resilient to cloud provider failures

Nov 19, 2025 By Anthony Rindone In Datadog

As major incidents like AWS’s October 2025 outage illustrate, modern systems are immensely interconnected. A failure in one can lead to a cascade of downstream problems. In this case, issues with DNS resolution for DynamoDB led to widespread disruptions with other AWS services and, subsequently, thousands of applications and services that rely on that infrastructure.

Read Post

Datadog

Read more about How Datadog Feature Flags is resilient to cloud provider failures

Explore Cloud Instance Pricing and Performance with Datadog Instance Explorer

Nov 19, 2025 By Datadog In Datadog

Meet Datadog Instance Explorer — a way to explore, compare, and monitor cloud instance pricing and performance across AWS, Azure, and Google Cloud in one place. In this quick overview, you’ll learn how to: Start exploring your instance options today and make smarter, data-driven infrastructure decisions.

View Video

Datadog

Read more about Explore Cloud Instance Pricing and Performance with Datadog Instance Explorer

Introducing Datadog Agent Builder: Build agentic workflows for alert response and remediation

Nov 18, 2025 By Amber Tunnell In Datadog

Building automated workflows that adapt to real-world complexity can be a challenge. As systems scale and scenarios multiply, teams often end up hardcoding endless logic branches just to handle every potential outcome. That’s why we’re introducing Datadog Agent Builder, a powerful new tool that lets you create custom AI agents that are fully hosted by Datadog.

Read Post

Datadog

Read more about Introducing Datadog Agent Builder: Build agentic workflows for alert response and remediation

Datadog GPU Monitoring: Optimize and troubleshoot AI infrastructure

Nov 18, 2025 By Datadog In Datadog

With Datadog GPU Monitoring, engineering and ML teams can monitor GPU fleet health across cloud, on-prem, and GPU-as-a-Service platforms like Coreweave and Lambda Labs. Real-time insights into allocation, utilization, and failure patterns make it easy to spot bottlenecks, eliminate idle GPU spend, and resolve provisioning gaps. By tying usage metrics directly to cost and surfacing hardware and networking issues impacting performance, Datadog helps teams make fast, cost-efficient decisions to keep AI workloads running reliably at scale.

View Video

Datadog

Read more about Datadog GPU Monitoring: Optimize and troubleshoot AI infrastructure

Operations | Monitoring | ITSM | DevOps | Cloud

Define, run, and scale custom LLM-as-a-judge evaluations in Datadog

Build custom apps in seconds with conversational AI in App Builder

Introducing Bits AI SRE, your AI on-call teammate

Data Observability: Build confidence in the data life cycle

Coordinate large-scale engineering initiatives with IDP Campaigns

Use OpenTelemetry with Observability Pipelines for vendor-neutral log collection and cost control

How Datadog Feature Flags is resilient to cloud provider failures

Explore Cloud Instance Pricing and Performance with Datadog Instance Explorer

Introducing Datadog Agent Builder: Build agentic workflows for alert response and remediation

Datadog GPU Monitoring: Optimize and troubleshoot AI infrastructure

Monthly Archive

Follow Us