Latest Posts

Introducing Updog.ai: Real-time provider status from Datadog

Oct 21, 2025 By Brianne Bujnowski In Datadog

When external SaaS providers or cloud services degrade or go down, engineers often find themselves wondering if the issue they're encountering is local or more widespread. The answers they find are usually slow to surface, limited in detail, or entirely dependent on the provider's updates. Vendor-controlled status pages and third-party aggregators don’t provide the timely, independent visibility that's necessary to quickly and accurately identify the root cause of slowdowns.

Read Post

Datadog

Read more about Introducing Updog.ai: Real-time provider status from Datadog

Optimize HPC jobs and cluster utilization with Datadog

Oct 21, 2025 By Michael Cronk In Datadog

High-performance computing (HPC) environments support some of the most critical workloads in the world—from asset pricing models in financial institutions to molecular simulations in drug discovery. These workloads often span hundreds of thousands of cores, depend on specialized infrastructure such as GPUs, and run for extended periods. As a result, performance and efficiency are critical.

Read Post

Datadog

Read more about Optimize HPC jobs and cluster utilization with Datadog

Detect and map third-party outages with Datadog External Provider Status

Oct 21, 2025 By Brianne Bujnowski In Datadog

Modern applications depend on dozens of external cloud platforms, APIs, and SaaS services to function. But when those providers experience issues, engineers often spend valuable time asking a basic question: Is the problem with us or with them? Provider-maintained status pages are often slow to update, leaving teams waiting for confirmation while incidents escalate. This delay wastes valuable time, prolongs investigations, and risks customer trust.

Read Post

Datadog

Read more about Detect and map third-party outages with Datadog External Provider Status

Track, debug, and roll back changes with Version History for Synthetic Monitoring tests

Oct 17, 2025 By Lauren Zuniga In Datadog

A synthetic test is only useful if you can trust what it’s telling you. When one fails, the reason may not be obvious. Was the application updated? Did the test change? Or both? As more people contribute and refine the same test, it becomes harder to understand what changed or restore a working version. Without clear visibility into those updates, teams can spend more time tracking down the cause of a failure than resolving it.

Read Post

Datadog

Read more about Track, debug, and roll back changes with Version History for Synthetic Monitoring tests

A deep dive into Java garbage collectors

Oct 17, 2025 By Jean-Philippe Bempel In Datadog

Historically, developers have relied on languages like C and C++ for explicit control over memory allocation and deallocation. This approach can yield very low overhead and tight control over performance, but it also increases complexity and risk (e.g., memory leaks, dangling pointers, and double frees). This often results in runtime issues that are difficult to diagnose, which can become a drag on team velocity.

Read Post

Datadog

Read more about A deep dive into Java garbage collectors

Ingest OTLP metrics directly into Datadog with the new OTLP Metrics API

Oct 17, 2025 By Connor Ward In Datadog

Many organizations rely on OpenTelemetry (OTel) to standardize observability across distributed systems. These organizations are at varying stages of adoption and are implementing OTel in complex environments with diverse configurations. To support this range of use cases, Datadog offers many ways to use OpenTelemetry with Datadog.

Read Post

Datadog

Read more about Ingest OTLP metrics directly into Datadog with the new OTLP Metrics API

Monitor logs from Amazon EKS on Fargate with Datadog

Oct 15, 2025 By Justin Lesko In Datadog

Amazon EKS on Fargate is a managed service that reduces the operational overhead of maintaining a Kubernetes cluster by abstracting away the underlying infrastructure. In a serverless Fargate environment, each pod is assigned its own isolated compute resources; there is no direct host-level access.

Read Post

Datadog

Read more about Monitor logs from Amazon EKS on Fargate with Datadog

Manage and optimize your OCI costs with Datadog Cloud Cost Management

Oct 9, 2025 By Patrick Krieger In Datadog

Engineering teams need to deliver reliable, secure, and high-performing applications, all while keeping costs under control. But engineers often lack visibility into cloud cost data, relying on finance-driven reports that they receive only after the billing cycle closes. Without daily cost insights alongside observability data, they don’t know until it’s too late that an infrastructure change caused a significant cost increase.

Read Post

Datadog

Read more about Manage and optimize your OCI costs with Datadog Cloud Cost Management

How we use Datadog to get comprehensive, fine-grained visibility into our email delivery system

Oct 7, 2025 By Alexa Liaskovski In Datadog

Visibility into email performance is indispensable to any organization that counts on its ability to reach people through their inboxes, including Datadog. SREs, FinOps, and many other teams rely on email as a critical channel for communications from our platform, including monitor alerts, usage reports, and service account notifications. At Datadog, we depend on the visibility provided by our integrations for Mailgun, SendGrid, and Amazon SES to optimize our email performance and ensure deliverability.

Read Post

Datadog

Read more about How we use Datadog to get comprehensive, fine-grained visibility into our email delivery system

Instantly respond to changes in your data with Datadog automation rules

Oct 7, 2025 By Barak Shoushan In Datadog

Datadog Workflow Automation can automate processes and reduce the amount of time spent on time-consuming, repetitive tasks. You can trigger these workflows in real time by tying them to alerts, dashboards, Slack messages, and other signals.

Read Post

Datadog

Read more about Instantly respond to changes in your data with Datadog automation rules

Operations | Monitoring | ITSM | DevOps | Cloud

Introducing Updog.ai: Real-time provider status from Datadog

Optimize HPC jobs and cluster utilization with Datadog

Detect and map third-party outages with Datadog External Provider Status

Track, debug, and roll back changes with Version History for Synthetic Monitoring tests

A deep dive into Java garbage collectors

Ingest OTLP metrics directly into Datadog with the new OTLP Metrics API

Monitor logs from Amazon EKS on Fargate with Datadog

Manage and optimize your OCI costs with Datadog Cloud Cost Management

How we use Datadog to get comprehensive, fine-grained visibility into our email delivery system

Instantly respond to changes in your data with Datadog automation rules

Monthly Archive

Follow Us