Monthly Archive

Golden signals in seconds with Universal Service Monitoring

Nov 29, 2022 By Stephen Pinkerton In Datadog

Whether you are a site reliability engineer, DevOps engineer, or application developer, you need visibility into the health and performance of every service you run or support. But in complex, dynamic environments, it can be difficult to ensure that all services are accounted for.

Read Post

Datadog

Read more about Golden signals in seconds with Universal Service Monitoring

Monitor your mobile apps with Embrace's offering in the Datadog Marketplace

Nov 29, 2022 By Thomas Sobolik In Datadog

Embrace is a mobile application monitoring solution that helps you track and troubleshoot mobile app performance by combining data analytics, real user monitoring, network performance monitoring, and hardware monitoring in a single platform. We’re pleased to partner with Embrace to offer an out-of-the-box Embrace Datadog app and software license in the Datadog Marketplace.

Read Post

Datadog

Read more about Monitor your mobile apps with Embrace's offering in the Datadog Marketplace

Announcing TISAX-compliant observability for the automotive industry and its suppliers

Nov 29, 2022 By Aaron Kaplan In Datadog

Many organizations face complex regulatory requirements when it comes to monitoring the health and performance of their service and application infrastructure. As part of our ongoing commitment to providing a comprehensive monitoring solution for all customers, we’re pleased to announce that Datadog has achieved TISAX Assessment Level 2 (AL2) certification.

Read Post

Datadog

Read more about Announcing TISAX-compliant observability for the automotive industry and its suppliers

Improve your EC2 rightsizing recommendations with Datadog and AWS Compute Optimizer

Nov 29, 2022 By Addie Beach In Datadog

While cloud solutions can give you greater flexibility as you scale your infrastructure, limited visibility into resource utilization makes provisioning the right amount of compute resources challenging. To ensure that every workload is fully supported, many organizations may opt to over-provision, which leads to overspending. Or, in an attempt to maximize cost savings, organizations may under-provision, leaving workloads unsupported and risking serious performance impacts.

Read Post

Datadog

Read more about Improve your EC2 rightsizing recommendations with Datadog and AWS Compute Optimizer

Track and triage errors in your logs with Datadog Error Tracking

Nov 28, 2022 By Ayush Kapur In Datadog

Reducing noise in your error logs is critical for quickly identifying bugs in your code and determining which to prioritize for remediation. To help you spot and investigate the issues causing error logs in your environments, we’re pleased to announce that Datadog Error Tracking is now available for Log Management in open beta.

Read Post

Datadog

Read more about Track and triage errors in your logs with Datadog Error Tracking

This Month in Datadog: Dash 2022 Recap

Nov 28, 2022 By Datadog In Datadog

Datadog is constantly elevating the approach to cloud monitoring and security. This Month in Datadog updates you on our newest product features, announcements, resources, and events. This month, we take you straight to the Javits Center for a showcase of Dash 2022..

View Video

Datadog

Read more about This Month in Datadog: Dash 2022 Recap

Mitigate cold starts in your Java Lambda functions with Datadog and AWS Lambda SnapStart

Nov 28, 2022 By Danny Driscoll In Datadog

AWS Lambda enables engineering teams to build modern, scalable services without the need to provision underlying infrastructure resources. But monitoring Lambda functions requires visibility into performance indicators that differ from those of traditional architectures—and cold starts are a key example.

Read Post

Datadog

Read more about Mitigate cold starts in your Java Lambda functions with Datadog and AWS Lambda SnapStart

Continuous Testing Demo

Nov 21, 2022 By Datadog In Datadog

Continuous Testing provides efficient, reliable testing that works seamlessly with the rest of your CI/CD pipelines. In this demo, we show you how you can use the Continuous Testing Explorer page, GitHub integration, and codeless web recorder to easily create useful tests and analyze the results.

View Video

Datadog

Read more about Continuous Testing Demo

Datadog acquires Cloudcraft

Nov 21, 2022 By Cansu Berkem In Datadog

A well-designed cloud architecture is essential to ensure that the underlying infrastructure stays operational, within budget, and compliant over time. These days, organizations are rapidly spreading their infrastructure across a broad, complex mesh of interconnected resources and services. It can be difficult to make high-level decisions about the design and management of these systems. This is why many organizations are now turning to cloud infrastructure modeling tools.

Read Post

Datadog

Read more about Datadog acquires Cloudcraft

Architecting for Reliability

Nov 18, 2022 By Datadog In Datadog

As modern systems become increasingly more complex, the risk of incidents and outages increases. Old approaches to reliability can sometimes be adapted to novel system designs, but other times new methods need to be invented. In this panel session moderated by Datadog’s Jason Yee, you’ll hear from SRE leaders and systems architects across the industry about how they’re designing and operating systems to achieve greater reliability.

View Video

Datadog

Read more about Architecting for Reliability

Democratizing Observability

Nov 18, 2022 By Datadog In Datadog

DevOps principles have helped many organizations improve cross-team collaboration, which has in turn led to increased reliability and velocity in the development lifecycle. In this session moderated by Jason Yee, we hear from panelists who have applied these same DevOps principles to observability, helping them unlock data-based insights and empower teams to make smarter, more informed decisions.

View Video

Datadog

Read more about Democratizing Observability

RUM now offers React Native Crash Reporting and Error Tracking

Nov 17, 2022 By Aaron Kaplan In Datadog

React Native has become the predominant development framework for cross-platform mobile applications. By interacting with native APIs largely under the hood and requiring only a fractional proportion of platform-specific code, it allows you to build applications for iOS, Android, and the browser using the same declarative JavaScript. But this cross-platform adaptability has its downsides.

Read Post

Datadog

Read more about RUM now offers React Native Crash Reporting and Error Tracking

Dash Panel Discussion: What Users Really Want

Nov 17, 2022 By Datadog In Datadog

Measuring user experience is typically done by tracking metrics like latency and purchase frequency. But these metrics can often obscure real user sentiment. In this panel session moderated by Miranda Kapin, you can learn about better ways to uncover how users are truly experiencing your application and methods for improving their engagement.

View Video

Datadog

Read more about Dash Panel Discussion: What Users Really Want

Generate RUM-based metrics to track historical trends in customer experience

Nov 15, 2022 By Bowen Chen In Datadog

Datadog Real User Monitoring (RUM) provides end-to-end visibility into the user experience and performance of your browser and mobile applications. RUM allows you to capture and retain complete user sessions for 30 days. This means you can pinpoint bugs, prioritize issues, and determine fixes with data collected across an entire quarter.

Read Post

Datadog

Read more about Generate RUM-based metrics to track historical trends in customer experience

Building a Multi-Tenant Insurance Platform

Nov 15, 2022 By Datadog In Datadog

In 2020, CoverWallet—a multi-tenant insurance platform—was acquired by Aon, which led to a rapid expansion in both the size and global presence of its engineering organization. In his talk, CoverWallet’s Hylke Alons walks through the changes that were necessary to meet their platform's new expectations, including improving growth and scalability while ensuring reliability, automating security, and reducing maintenance. He also discusses some best practices for scaling up engineering and product teams to handle demand in a complex and highly regulated industry like insurance.

View Video

Datadog

Read more about Building a Multi-Tenant Insurance Platform

Stress test your Kubernetes application with Speedscale's offering in the Datadog Marketplace

Nov 11, 2022 By Bowen Chen In Datadog

Properly testing a service’s APIs to ensure that it can handle production traffic presents many challenges for engineers—SREs need to guarantee the resiliency of their application, while developers must ensure that their features perform well at any given scale. Speedscale is a testing framework built for Kubernetes applications that enables you to load test with real-world production scenarios by replaying actual API traffic that your application has experienced.

Read Post

Datadog

Read more about Stress test your Kubernetes application with Speedscale's offering in the Datadog Marketplace

Expanded Datadog Lambda extension capabilities with the AWS Lambda Telemetry API

Nov 10, 2022 By Jordan Obey In Datadog

In 2021, we partnered with AWS to develop the Datadog Lambda extension which provides a simple, cost-effective way for teams to collect traces, logs, custom metrics, and enhanced metrics from Lambda functions and submit them to Datadog.

Read Post

Datadog

Read more about Expanded Datadog Lambda extension capabilities with the AWS Lambda Telemetry API

A practical guide to capturing production traffic with eBPF

Nov 10, 2022 By Guy Arbitman In Datadog

Monitoring HTTP sessions offers a potentially powerful way to gain visibility into your web servers, but in practice, doing so can be complex and resource-intensive. Extended Berkeley Packet Filter (eBPF) technology allows you to overcome these challenges, giving you a simple and efficient way to process application-layer traffic for your troubleshooting needs.

Read Post

Datadog

Read more about A practical guide to capturing production traffic with eBPF

Changing Perspectives: A Deep Dive into the Security Posture of 600+ Real-World AWS Environments

Nov 8, 2022 By Datadog In Datadog

Earlier this year, Datadog released the “State of AWS Security” study, which examined real-world data from more than 600 organizations and AWS accounts to understand the security posture of global AWS users who also leverage the Datadog Cloud Security Platform. Join Datadog’s Christophe Tafani-Dereeper and Andrew Krug as they explore some important insights from this study, such as the top ways organizations are breached on AWS and how tooling like Datadog Cloud Security Posture Management can help.

View Video

Datadog

Read more about Changing Perspectives: A Deep Dive into the Security Posture of 600+ Real-World AWS Environments

Auditing Your Automation's Access: Using More Automation

Nov 8, 2022 By Datadog In Datadog

Between CI/CD pipelines, container orchestrators, and developer debugging tools, more and more automation is needed to scale your systems. But how do you know if that automation is accessing the right systems at the right time? And how do you ensure that your automation is safe from exploits by unauthorized users?

View Video

Datadog

Read more about Auditing Your Automation's Access: Using More Automation

I've Made a Huge Mistake: Implementing Agile on Infrastructure Teams

Nov 8, 2022 By Datadog In Datadog

Bad planning methods can damage team morale and prevent teams from improving the systems they maintain. In this talk, Sam Handler from Shopify explains how his attempts to fix poor infrastructure planning processes through Agile methods failed. Drawing from this experience, he offers several principles that can help infrastructure teams improve the way they work.

View Video

Datadog

Read more about I've Made a Huge Mistake: Implementing Agile on Infrastructure Teams

Scaling Up, One Network Bottleneck at a Time

Nov 8, 2022 By Datadog In Datadog

Processing data at scale involves moving packets through a network—but what happens when that network isn't cooperative? Anatole Beuzon, a Software Engineer at Datadog, discusses how he investigated and resolved network issues in Datadog’s larger data-processing apps and how you can apply these same methods to your own production workloads.

View Video

Datadog

Read more about Scaling Up, One Network Bottleneck at a Time

Ask a Site Reliability Engineer (SRE)

Nov 8, 2022 By Datadog In Datadog

Site reliability engineering (SRE) can be complicated, and at Datadog, we’ve spent a lot of time thinking about SRE and refining how we implement it. Join Datadog’s Brandon West and Rick Mangi as they provide a brief overview of SRE and its core concepts. This video also contains a Q&A session from the live taping of this panel.

View Video

Datadog

Read more about Ask a Site Reliability Engineer (SRE)

FinOps and Cloud Cost Optimization

Nov 8, 2022 By Datadog In Datadog

As companies scale, it’s become increasingly important to keep cloud cost management and optimization top of mind. In this talk, Yuval Yogev from Sygnia walks you through Sygnia’s optimization journey of cutting their total cloud costs in half. Yogev also shares insights into how you can optimize your own organization’s cloud usage and spend.

View Video

Datadog

Read more about FinOps and Cloud Cost Optimization

Deploying OpenTelemetry Organizationally: From Proof of Concept to In-Production at Scale

Nov 8, 2022 By Datadog In Datadog

Observability involves telling a coherent story about an entire system. Over the years, video streaming service Pluto TV has had to navigate many storytellers in terms of observability vendors, tools, and formats before settling on OpenTelemetry to analyze and compare features across its many destination platforms. During this presentation, you'll see how Bharathi Ramachandran—Engineering Manager at Pluto TV—used OpenTelemetry to implement his initial proof of concept and get his entire organization shipping observability data at scale.

View Video

Datadog

Read more about Deploying OpenTelemetry Organizationally: From Proof of Concept to In-Production at Scale

New GKE dashboards and metrics provide deeper visibility into your environment

Nov 3, 2022 By Steve Harrington In Datadog

Google Kubernetes Engine (GKE) is a managed Kubernetes service that enables users to deploy and orchestrate containerized applications on Google’s infrastructure. Datadog’s GKE integration, when paired with our Kubernetes integration, has always provided deep visibility into the health and performance of your clusters at the node, pod, container, and application levels.

Read Post

Datadog

Read more about New GKE dashboards and metrics provide deeper visibility into your environment

Monitoring MongoDB performance metrics (WiredTiger)

Nov 2, 2022 By Jean-Mathieu Saponaro In Datadog

This post is part 1 of a 3-part series about monitoring MongoDB performance with the WiredTiger storage engine. Part 2 explains the different ways to collect MongoDB metrics, and Part 3 details how to monitor its performance with Datadog. If you are using the MMAPv1 storage engine, visit the companion article “Monitoring MongoDB performance metrics (MMAP)”.

Read Post

Datadog

Read more about Monitoring MongoDB performance metrics (WiredTiger)

Operations | Monitoring | ITSM | DevOps | Cloud

Golden signals in seconds with Universal Service Monitoring

Monitor your mobile apps with Embrace's offering in the Datadog Marketplace

Announcing TISAX-compliant observability for the automotive industry and its suppliers

Improve your EC2 rightsizing recommendations with Datadog and AWS Compute Optimizer

Track and triage errors in your logs with Datadog Error Tracking

This Month in Datadog: Dash 2022 Recap

Mitigate cold starts in your Java Lambda functions with Datadog and AWS Lambda SnapStart

Continuous Testing Demo

Datadog acquires Cloudcraft

Architecting for Reliability

Democratizing Observability

RUM now offers React Native Crash Reporting and Error Tracking

Dash Panel Discussion: What Users Really Want

Generate RUM-based metrics to track historical trends in customer experience

Building a Multi-Tenant Insurance Platform

Stress test your Kubernetes application with Speedscale's offering in the Datadog Marketplace

Expanded Datadog Lambda extension capabilities with the AWS Lambda Telemetry API

A practical guide to capturing production traffic with eBPF

Changing Perspectives: A Deep Dive into the Security Posture of 600+ Real-World AWS Environments

Auditing Your Automation's Access: Using More Automation

I've Made a Huge Mistake: Implementing Agile on Infrastructure Teams

Scaling Up, One Network Bottleneck at a Time

Ask a Site Reliability Engineer (SRE)

FinOps and Cloud Cost Optimization

Deploying OpenTelemetry Organizationally: From Proof of Concept to In-Production at Scale

New GKE dashboards and metrics provide deeper visibility into your environment

Monitoring MongoDB performance metrics (WiredTiger)

Monthly Archive

Follow Us