Operations | Monitoring | ITSM | DevOps | Cloud

Latest Videos

Scaling Down Kubernetes Clusters

Datadog, the observability platform used by thousands of companies, runs on dozens of self-managed Kubernetes clusters in a multi-cloud environment, adding up to tens of thousands of nodes, or hundreds of thousands of pods. This infrastructure is used by a wide variety of engineering teams at Datadog, with different feature and capacity needs.

Provisioning and Autoscaling

Datadog, the observability platform used by thousands of companies, runs on dozens of self-managed Kubernetes clusters in a multi-cloud environment, adding up to tens of thousands of nodes, or hundreds of thousands of pods. This infrastructure is used by a wide variety of engineering teams at Datadog, with different feature and capacity needs.

Paving the Road for Proactive Reliability

At Expedia Group, Kaushik Patel and Nikos Katirtzis have thousands of engineers and micro-services. Heterogeneity in terms of infrastructure and technologies used over the years created inefficiencies and posed the need for a set of automated best practices for our engineering teams. Over the past 2 years, using a data-driven approach, we’ve worked on creating a set of platforms that helps teams to adopt good reliability practices, including chaos engineering, release safety, or automatic failover between cloud regions. In this talk Kaushik and Nikos will cover the platforms they’ve built, including how they used data to drive their investment decisions.

How Toyota is using Datadog and AI/ML to invent new ways for humans to be more mobile #datadog

Toyota is best known for making great cars and trucks, and as a leader in technology and mobility, they are on a mission to build a better future where everyone has the freedom to move. By partnering with Datadog, Toyota is taking advantage of the latest AI/ML to innovate and invent new ways for humans to be more mobile, while future proofing Toyota’s tech stack.

Scaling Up, One Network Bottleneck at a Time #shorts #datadog

Processing data at scale involves moving packets through a network—but what happens when that network isn't cooperative? Anatole Beuzon, a Software Engineer at Datadog, discusses how he investigated and resolved network issues in Datadog’s larger data-processing apps and how you can apply these same methods to your own production workloads.

Building an Internal Development Platform (IDP): A Journey of Innovation and Growth #shorts

As your organization grows, the increased number of engineers and services can put a strain on your infrastructure and ops teams. As Latin America’s largest online commerce and payments ecosystem, MercadoLibre needed to solve this scaling challenge. So we embarked on a mission to build an Internal Development Platform (IDP). We’ll highlight our transformative journey and how the IDP grew to manage over 26,000 microservices, while delivering a highly productive environment to MercadoLibre’s 12,000+ developers. In this session, you’ll learn about the challenges and solutions required to successfully build your own IDP.