Demo Roundup: PagerDuty Operations Cloud for Kubernetes
In this demo, Corbin Mills shows how to use the PagerDuty Operations Cloud to streamline and automate how a node failure is resolved. You’ll see how he uses event orchestration (in PagerDuty AIOps) to enrich an alert with pod names, and automatically runs a job to check the Kube API status, so that a responder has instant context. AIOps is also grouping and suppressing alerts. Then you’ll see how the responder can run more health status checks without the need to SSH into the environment or interrupt a co-worker for access. You’ll also see how the responder uses AI-generated draft of a status update to quickly inform stakeholders, incident workflows to automate several communications channels, and an AI-generated incident postmortem after the incident is resolved. Finally, you’ll see how those learnings are applied to make the second occurrence a self-resolving incident.
Subscribe to our channel: https://bit.ly/3BNQYNS
Subscribe to our Twitch channel to tune into these Demo Roundup episodes live: https://www.twitch.tv/pagerduty
Follow us on Social:
Instagram: https://bit.ly/3xf96g8
Facebook: https://bit.ly/3zCUM2g
Twitter: https://bit.ly/3f4PC7p
LinkedIn: https://bit.ly/3f3Ex6R
#Kubernetes #AIOps #IncidentResponse #ProcessAutomation