Operations | Monitoring | ITSM | DevOps | Cloud

Latest Videos

Security and Monitoring with Istio - Take5

Kubernetes can be a great orchestration platform for your microservices but these services can grow complex and difficult to manage. Enter Istio, an integrated way to create a network of your services and manage load balancing, authentication, and more! Join us as we walk you through Istio and several ways it can help you wrangle your applications.

Postmortems and Retrospectives (class SRE implements DevOps)

Even after a service has been restored, SREs still have a bit of work to do. In this video, Liz and Seth discuss the postmortem process that SREs follow. Blameless postmortems and retrospectives are key to learning from failures and preventing recurrence. You will learn about the importance of conducting a postmortem, strategies for conducting a blameless postmortem, and techniques for trending retrospectives across your entire organization to gain better insights to prevent service disruptions in the future.

Disruption Detector and Real Time Monitoring with Stackdriver (Cloud Next '18)

Aja built an interactive disruption detector panel for attendees at the Google I/O Conference to intentionally cause errors to happen to the system. This demo highlights the amazing real time monitoring feature of Stackdriver as it tracks all incoming errors and make things easier for developers to pinpoint the issue. Watch the video to learn more.

Incident Management (class SRE implements DevOps)

In the previous video, Liz and Seth discussed how to make systems observable and how observability helps us diagnose failing systems, but didn't cover what to do when an incident grows beyond the ability of one person to do it all. In this video, you learn about the most important part of the incident management process – humans.

Cloud OnAir: CE TV: Application Observability with LightStep

Observability remains a key challenge as customers embrace DevOps. Join Daniel "Spoons" Spoonhower, the CTO and Founder of Lightstep, a Google Cloud customer, and Yuri Grinshteyn, a Google Cloud Customer Engineer to learn about how Lightstep was built on Google Cloud to enable you to monitor what matters most and diagnose anomalies within seconds across web, mobile, monoliths and microservices.