Operations | Monitoring | ITSM | DevOps | Cloud

Search and analyze unsampled logs in real time with Live Tail

With thousands of logs generated every minute from your infrastructure, applications, services, and devices, retaining all of this data for active search and analysis can be cost-prohibitive. Because log volumes continue to grow rapidly as operations scale, it’s common for organizations to implement log management strategies and limit the amount that they store in order to minimize costs.

Why Context Matters: Mastering Serverless App Monitoring

Hi there, and welcome to the second video in this series on observing AWS serverless applications with Datadog. In this video, you’ll learn how important it is to add custom business context to the telemetry you send to Datadog and how you can use that inside APM to quickly diagnose and debug issues. You’ll walk away with an understanding of the importance of distributed tracing, as well as how you can add specific business context to the telemetry you send.

Instrument Google Cloud Run applications with the new Datadog Agent sidecar

Google Cloud Run is a fully managed service that allows you to deploy, manage, and scale workloads on serverless containers. Because Cloud Run abstracts away infrastructure management and runs on complex, distributed backends, it can be difficult to troubleshoot. Datadog’s integrations with Google Cloud and Google Cloud Run address that challenge by collecting and visualizing key metrics and logs.

How Forbes delivers a premium digital experience with Datadog

Learn how Forbes, a global media powerhouse, successfully migrated to the cloud with Datadog. Discover how they enabled their teams across their entire tech stack to access IT data and make critical improvements. The team maintained a 99.5 percent uptime through proactive alerting and improved root cause analysis by 10 percent.

Increase control and reduce noise in your AWS logs using Datadog Observability Pipelines

Today’s SRE and security operations center (SOC) teams often find themselves overwhelmed by the sheer volume and variety of logs generated by critical AWS services such as VPC Flow Logs, AWS WAF, and Amazon CloudFront. While these logs can be valuable for detecting and investigating security threats, as well as troubleshooting issues in your environment, managing them at scale can be challenging and costly.

A deep dive into Database Monitoring index recommendations

Datadog Database Monitoring (DBM) Recommendations help you proactively optimize performance throughout your database fleet. DBM draws on a wide range of data sources in order to detect and provide actionable guidance on issues such as blocking queries, low disk space, and missing indexes. In this post, we’ll show you how DBM formulates targeted indexing recommendations to help you optimize database performance.

How to use locators to design more resilient synthetic tests

Most modern web applications are frequently updated to implement new features, execute marketing campaigns, or enhance their UX with new libraries or APIs. While this helps you better engage your users, constant UI updates make designing flexible, long-lasting tests challenging.

Observability for your NodeJS AWS Serverless Applications

Hi there, and welcome to the first video in this series on observing AWS serverless applications with Datadog. In this video, you’ll learn how easy it is to get started observing your serverless NodeJS applications using Datadog and the AWS CDK. You’ll also look at how you can use the Datadog console to diagnose latency issues and errors inside your application. You’ll walk away with an understanding of how to instrument your Lambda functions with the AWS CDK, as well as practical steps you can take to debug your applications.

Improve developer experience and collaboration with Software Catalog

As software ecosystems grow more complex and fragmented, organizations are finding it harder to manage the thousands of interdependencies that make up their environments. For starters, engineers are collectively struggling to uphold security and reliability standards throughout their organizations because they lack a shared view of these complex software landscapes.