Operations | Monitoring | ITSM | DevOps | Cloud

PagerDuty

The Unplanned Show, Episode 25: Learning from incidents with Nora Jones

The incident is resolved. The service is restored. Now what? To dig into how teams can learn from incidents and improve resiliency, this episode has author of "Chaos Engineering" (O'Reilly), creator of the "Learning From Incidents" community, and founder of Jeli.io (recently acquired by PagerDuty), the one, the only, Nora Jones.

Modernize your ITSM with the New PagerDuty Application for ServiceNow

We live in an always-on world, where things move fast and break often. Building stronger resilience is critical for operational efficiency and delivering great customer experiences. CIOs have heavily invested in ITSM solutions, but a centralized, queued approach is no longer meeting the needs of modern organizations when it comes to critical, customer-impacting issues.

Predictions for 2024 - Learn from PagerDuty's CIO and CISO!

Join us as we kick off the year with our leaders discussing their 2024 predictions. Automation and generative AI will continue to play a big role in everything a CIO and CISO does, so come and learn from PagerDuty’s CIO, Eric Johnson and CISO, Heather Hinton, about their top predictions for 2024 and how to best adopt automation and generative AI into your department’s strategies.

Unlocking the Value of your Runbook Automation Value Metrics with Snowflake, Jupyter Notebooks, and Python

This blog was co-authored by Justyn Roberts, Senior Solutions Consultant, PagerDuty Automation has become an integral piece in business practices of the modern organization. Oftentimes when folks hear “automation,” they think of it as a means to remove the manual aspect of the work and speed up the process; however, what lacks the spotlight is the value and return automation can offer to an organization, a team, or even just one specific process.

APAC Retrospective, Part 2: Mobilise: From Signal to Action

Continuing our series on 2023 learnings from APAC, it’s increasingly evident that incidents in organisations are not a matter of ‘if’ but ‘when,’ regardless of their size or industry. Recently, the APAC region has been witnessing regulatory bodies taking stricter actions against major companies for subpar services, leading to substantial penalties.

Practitioners Share How They Remove the Fear of On-Call

Being on-call isn’t likely to be the most enjoyable aspect of a job. In fact, there might be a certain level of stress and fear around engineering teams about going on call: maybe the page will be missed, or maybe a page will come in at 2am and require troubleshooting a production issue for hours.

Episode 23: Zero-Downtime Updates with Todd Whitney

With limited error budgets and low user tolerance for maintenance window, the ability to execute routine updates without a maintenance window is an increasingly important socio-technical capability. Hear from Todd Whitney, who recently spoke at HashiConf about how PagerDuty performs updates while upholding its promise to customers of taking zero maintenance windows.