Operations | Monitoring | ITSM | DevOps | Cloud

How we use RUM to make design decisions that enhance user experience

Before we started using Datadog Real User Monitoring (RUM), we relied on frontend logging to gather data about the user experience. Logs gave us some helpful information about exceptions and errors but didn't provide any insight into issues directly related to the user’s perspective.

Monitoring AI Proxies to optimize performance and costs

Businesses deploying LLM workloads increasingly rely on LLM proxies (also known as LLM gateways) to simplify model integration and governance. Proxies provide a centralized interface across LLM providers, govern model access and usage, and apply compliance safeguards for smoother operations and reduced complexity—making LLM usage more consistent and scalable.

Introducing the Datadog Developer Hub

Finding the right integrations, libraries, and open source tooling to extend a product has long been a challenge for developers. While Datadog has a vast offering of monitoring and observability solutions, many teams need to customize their setup in some way—whether by extending the Datadog Agent, integrating with third-party services, or using SDKs to interact with the Datadog API.

Optimize cross-platform mobile apps with Datadog RUM and Kotlin Multiplatform support

Mobile developers are increasingly adopting Kotlin Multiplatform to share business logic across iOS and Android. While Kotlin Multiplatform reduces duplication of code-writing efforts, it also introduces blind spots. Developers often lack real-time visibility into how shared code performs across platforms, making it harder to troubleshoot issues and monitor user experience.

The Datadog Agent: Why it's essential for monitoring your infrastructure and applications with Datadog

If you’re a Datadog customer, you’re likely using our platform to gain visibility into your infrastructure and applications and to troubleshoot using logs, metrics, and traces when issues arise. To support these efforts, you’ll want access to the most granular telemetry signals and intuitive workflows that streamline your investigation.

3 ways to drive software delivery success with Datadog DORA Metrics

Delivering software quickly and reliably is the main focus of modern DevOps. But to improve your delivery performance, you need to understand it, and that starts with measurement. Teams primarily measure performance in this area by using DORA metrics—deployment frequency, change lead time, change failure rate, and time to restore service*. These metrics help teams understand trends in their software delivery practices in quantifiable terms that they can track and improve over time.

Unify your FinOps and engineering workflows in Datadog Cloud Cost Management

As your applications scale across cloud and SaaS providers, allocating costs and optimizing workloads become increasingly important—and challenging. Without access to cost data in their daily workflows, engineering teams can’t easily understand the cost of their resources and identify where they can reduce their spend. And while FinOps teams have access to cost data, they often review this information in silos.

Key metrics for monitoring Airflow

Airflow is a popular open source platform that enables users to author, schedule, and monitor workflows programmatically. Airflow helps teams run complex pipelines that require task orchestration, dependency management, and efficient scheduling across many different tools. It’s particularly useful for creating data processing pipelines, orchestrating task-based workflows such as machine learning (ML) training, and running cloud services.