Operations | Monitoring | ITSM | DevOps | Cloud

Improving Security Forensics in Kubernetes Environments

The success of Kubernetes has made monitoring and alerting more difficult for traditional Security Information and Event Management (SIEM) tools. Attend this live webinar to learn how implementing the right network security and compliance solution will improve the accuracy and completeness of security forensic monitoring and alerting when using Kubernetes.

Postmortems and Retrospectives (class SRE implements DevOps)

Even after a service has been restored, SREs still have a bit of work to do. In this video, Liz and Seth discuss the postmortem process that SREs follow. Blameless postmortems and retrospectives are key to learning from failures and preventing recurrence. You will learn about the importance of conducting a postmortem, strategies for conducting a blameless postmortem, and techniques for trending retrospectives across your entire organization to gain better insights to prevent service disruptions in the future.

Disruption Detector and Real Time Monitoring with Stackdriver (Cloud Next '18)

Aja built an interactive disruption detector panel for attendees at the Google I/O Conference to intentionally cause errors to happen to the system. This demo highlights the amazing real time monitoring feature of Stackdriver as it tracks all incoming errors and make things easier for developers to pinpoint the issue. Watch the video to learn more.

OpsRamp Webinar - OpsRamp + #ITSM: Incident Management For Superior Digital Performance

Manage your incident lifecycle with actionable insights so that you can prevent IT outages and reduce downtime. Proactive Monitoring. Drive system health, availability, and performance with policy-based monitoring for IT services hosted on data centers and public clouds.