Operations | Monitoring | ITSM | DevOps | Cloud

Latest Videos

SRE in Transition: From Startup to Enterprise

"Startups are defined by “ship or die”. As a result, SRE teams at a startup should be focused on enabling product engineers to ship features as quickly as possible. As your startup transitions from “we’ll run out of money in the next 18 months” to “we have more than 1000 engineers”, how should the SRE organization evolve and provide the best value through that transition (including booting one up if you don’t have one)? I will discuss specific ways the organization needs to evolve to meet this challenge, how the SRE org can advocate for and support this change (both in direct actions and in “influence”), and how the overhang of startup technical and cultural debt can make this shift more challenging (but also more necessary).

From On-call to Non-call: Resolving Incidents Before They Even Happen

Artificial intelligence has captured the attention of the world, with tools like ChatGPT and large language models (LLMs) driving the conversation. But you don’t need to wait for the future or new features powered by LLMs to start working smarter—the tech industry has been investing in intelligent, automated tools for years and they’re ready for production now. In this talk, you’ll learn how the engineering teams at Toyota Connected use tools like Datadog Watchdog, Anomaly Detection, and Workflows to make our lives easier and keep our platform stable.

From Solution to Startup

Before Datadog was a widely adopted SaaS platform, it was a tool developed to solve our founders’ own monitoring needs. As technology-oriented people, we often build solutions for our own problems, then discover those problems are widespread. But how do you know when your solution should be something more? In this panel session, we’ll talk with tech startup founders to hear their stories and advice for turning tools into businesses.

Container Security Fundamentals - Linux Namespaces (Part 3): The Network Namespace

In this video, we continue our examination of Linux namespaces by looking at some details of how the network namespace can be used to isolate a container’s view of network resources, and how this feature can be used for troubleshooting container problems.

Datadog Service Catalog Demo

See what it’s like to have a central hub for all service knowledge alongside real-time observability data, including ownership, reliability, performance, and security, all in one place. With Service Catalog, you can not only better evaluate your system’s production readiness and adherence to industry best practices at scale with Scorecards, but also better understand the interrelationships between different microservices, or capture the cascading dependencies, between services and teams.

This Month in Datadog: Data Streams Monitoring, OpenAI Integration, CoScreen V5, and more

Datadog is constantly elevating the approach to cloud monitoring and security. This Month in Datadog updates you on our newest product features, announcements, resources, and events. This month, we put the Spotlight on Data Streams Monitoring..

This Month in Datadog: DASH 2023, In-App WAF and User Protection, Cloudcraft for Azure, and more!

Datadog is constantly elevating the approach to cloud monitoring and security. This Month in Datadog updates you on our newest product features, announcements, resources, and events. This month, we put the Spotlight on DASH 2023..