Datadog on Kubernetes Monitoring

Datadog on Kubernetes Monitoring

Nov 18, 2020

With many blog posts published and talks given on the topic, it’s no secret that Datadog is running Kubernetes at scale. We currently run dozens of clusters, some of them with thousands of nodes. Additionally, we have clusters running in multiple clouds. How are we monitoring all of that, ensuring we can scale up quickly and safely?

In this session Ara Pulido, Technical Evangelist, will chat with Celene Chang and Charly Fontaine - both software engineers on the Container Integrations team at Datadog. This team is responsible for deploying and running the Datadog Agent in our Kubernetes clusters. We’ll cover how we are running the Datadog Agent in our clusters, which metrics we care about, and the monitors we have set up. By the end of the session you will have new ideas and best practices on monitoring Kubernetes with Datadog that you can apply in your own environment.

Links mentioned in the talk
ExtendedDaemonset Github: https://github.com/DataDog/extendeddaemonset
Watermark Pod Autoscaler Github: https://github.com/DataDog/watermarkpodautoscaler
How to monitor Kubernetes audit logs: https://www.datadoghq.com/blog/monitor-kubernetes-audit-logs/
Explore Kubernetes resources with Datadog Live Containers: https://www.datadoghq.com/blog/explore-kubernetes-resources-with-datadog/