%term

Every team should be A/B testing

Apr 17, 2026 By Ryan Lucht In Datadog

Technical teams want to know the newest, most cutting-edge tools they can implement to give themselves a competitive advantage, whether it’s the latest developer framework or modern CI/CD practices that boost velocity. But there’s one tool from all the way back in the 1920s that can improve any organization, no matter its scale: the randomized, controlled trial—or simply put, experiments.

Read Post

Datadog

Read more about Every team should be A/B testing

Route OTel data from AI apps to ClickHouse and Datadog using Observability Pipelines

Apr 16, 2026 By Micah Kim In Datadog

As organizations continue to heavily invest in AI and build more agentic workflows, their telemetry data volumes can surge quickly, and the associated costs can become unpredictable. To regain control of their data, many AI-forward teams are turning to high-throughput, low-latency pipelines to collect and route data to tools such as OpenTelemetry (OTel) and ClickHouse. But these self-hosted solutions come with drawbacks.

Read Post

Datadog

Read more about Route OTel data from AI apps to ClickHouse and Datadog using Observability Pipelines

Manage service tracing across hosts with Single Step Instrumentation rules

Apr 16, 2026 By Sarjeel Yusuf In Datadog

Single Step Instrumentation (SSI) simplifies Datadog Application Performance Monitoring (APM) by automatically discovering and instrumenting services across a host. For many teams, SSI is the ideal starting point because it helps them achieve full visibility with minimal setup. However, as environments grow, teams often want more control over which services get traced. Auxiliary workloads such as batch jobs and cron tasks might not require distributed tracing.

Read Post

Datadog

Read more about Manage service tracing across hosts with Single Step Instrumentation rules

Offline evaluation for AI agents: Best practices

Apr 14, 2026 By Tom Sobolik In Datadog

If you’re building LLM-powered applications and agents, you’ve probably asked yourself: “How do I know if my changes actually made things better?” You can tweak prompts, adjust temperature settings, or try different models, but it’s not always easy to validate whether version B’s response is better than version A’s. Most teams fly blind in preproduction and rely on user feedback to see how well their application works in the real world.

Read Post

Datadog

Read more about Offline evaluation for AI agents: Best practices

Platform engineering metrics: What to measure and what to ignore

Apr 9, 2026 By Candace Shamieh In Datadog

Platform engineering teams have access to hundreds of metrics, yet over 40% of platform initiatives cannot demonstrate measurable value within the first year. Teams that cannot quantify their impact fail to obtain executive sponsorship, risk being defunded, and ultimately, face deprecation. To accurately calculate a platform’s ROI, platform engineering teams need to differentiate between signals that measure platform effectiveness and those that should be used solely for investigative purposes.

Read Post

Datadog

Read more about Platform engineering metrics: What to measure and what to ignore

Integrate Recorded Future threat intelligence with Datadog Cloud SIEM

Apr 9, 2026 By Shreya Batra In Datadog

Recorded Future provides real-time threat intelligence about indicators of compromise (IOCs), including malicious IP addresses, domains, and vulnerabilities. It also adds context on threat actors and campaigns to help security teams understand which signals represent real risk and prioritize their responses accordingly.

Read Post

Datadog

Read more about Integrate Recorded Future threat intelligence with Datadog Cloud SIEM

Instrument and monitor Boomi integration flows with OpenTelemetry and Datadog

Apr 9, 2026 By Massimo Sporchia In Datadog

Boomi is an Integration Platform as a Service (iPaaS) used by thousands of organizations to connect applications, data, and workflows across cloud and on-premises environments. Business-critical processes, from order fulfillment pipelines to customer data synchronization, depend on Boomi Atoms and Molecules running reliably.

Read Post

Datadog

Read more about Instrument and monitor Boomi integration flows with OpenTelemetry and Datadog

Not all index scans are equal: How we cut query latency by over 99%

Apr 9, 2026 By Nenad Noveljic In Datadog

When engineers investigate SQL queries, they normally think of index scans as a fast and efficient step in the query’s execution plan. When executed correctly, they fetch only the relevant rows from your table as opposed to sequential scans that read the entire table, reducing latency and query costs. However, just because an execution plan uses an index scan doesn’t mean that the scan is fast or performant.

Read Post

Datadog

Read more about Not all index scans are equal: How we cut query latency by over 99%

Operating agentic AI with Amazon Bedrock AgentCore and Datadog LLM Observability: Lessons from NTT DATA

Apr 7, 2026 By Tohn Furutani In Datadog

This guest blog post is by Tohn Furutani, SRE Engineer at NTT DATA. Over the past year, the conversation around generative AI has shifted from single-shot use cases—such as summarization, Q&A, and chat interfaces—to agentic AI systems that can make decisions based on context, plan multistep actions, invoke tools, and adapt as conditions change.

Read Post

Datadog

Read more about Operating agentic AI with Amazon Bedrock AgentCore and Datadog LLM Observability: Lessons from NTT DATA

Practical AI-Enabled Observability for Agents and LLMs

Apr 7, 2026 By Datadog In Datadog

You’re told to “go build agents” without clear guidance on what that actually means, how to do it well, or how to know if it is working. You are not a data scientist. You are a software engineer. In this talk, a Datadog AI product leader Shri Subramanian breaks down what changes when you move from building applications to building AI agents, and why familiar approaches like traditional testing and linear delivery fall short. We will explore how agent development shifts the focus from code alone to data, prompts, and evaluation, and why functional reliability matters just as much as operational reliability.

View Video

Datadog

Read more about Practical AI-Enabled Observability for Agents and LLMs

Operations | Monitoring | ITSM | DevOps | Cloud

Every team should be A/B testing

Route OTel data from AI apps to ClickHouse and Datadog using Observability Pipelines

Manage service tracing across hosts with Single Step Instrumentation rules

Offline evaluation for AI agents: Best practices

Platform engineering metrics: What to measure and what to ignore

Integrate Recorded Future threat intelligence with Datadog Cloud SIEM

Instrument and monitor Boomi integration flows with OpenTelemetry and Datadog

Not all index scans are equal: How we cut query latency by over 99%

Operating agentic AI with Amazon Bedrock AgentCore and Datadog LLM Observability: Lessons from NTT DATA

Practical AI-Enabled Observability for Agents and LLMs

Monthly Archive

Follow Us