Operations | Monitoring | ITSM | DevOps | Cloud

March 2019

Shadow Like A Dutonian: Onboarding Engineers With On-Call Shadowing

On-call shadowing is an essential practice at PagerDuty. For a new engineer, a shadowing period serves as a kinder, smoother ramp-up to going on-call, with none of the stress or responsibility for diagnosing and fixing the issue. When we configure shadowing in PagerDuty, our goal is to simulate the process and actions of going on call as precisely as we can while making sure that actions of the “Shadow User” do not affect the primary engineer who is actually on call.

Leveraging Innovation To Accelerate Social Impact

We’ve heard it time and again: Digital transformation is happening across all industries and business is booming. Decades-old companies are migrating to the cloud, deploying new mobile features regularly, and adopting new technologies at dizzying rates, all in the name of increasing revenue.

Don't Be a Bystander, Be an Incident Commander

Many organizations have some kind of incident response process to coordinate during a major service outage. Some operationally mature companies incorporate a formal Incident Commander role in their process for a faster, more effective response. The Incident Commander serves as the final decision-maker during a major incident, delegating tasks and listening to input from subject matter experts in order to bring the incident to resolution.

Setting Up Your PagerDuty For Sweet Victory

Congratulations! You’ve just purchased PagerDuty, meaning you’ve decided to make an investment in your incident management process. However, in order to maximize your investment, you will need to understand all the moving pieces within PagerDuty. Today, we’ll be setting up PagerDuty for one team: the Bikini Bottom Team.

Cut Through Complexity With Better Event Intelligence

As operational complexity accelerates, our customers are realizing that it’s impossible to manage their services or innovate for their business without a mechanism to make sense of that complexity. That’s why our March product update focuses on Event Intelligence, which is all about turning chaotic monitoring data into actionable insights so that teams can work smarter and focus on the things that matter.

Postmortems Part 3: Getting The Most Out Of Your Postmortem Meetings

When we announced the launch of our Postmortem Guide, I wrote about the value of performing blameless postmortems and how to establish a culture of continuous learning. In this final installment of our blog series on postmortems, I share how to have effective postmortem meetings.

Agility In Everyday Activities

It’s already a couple of months into the new year, but a lot of us are still likely thinking about what we can improve on. You can call it resolutions, goals, or something else, but many of us will be fine-tuning our mindsets and attitudes so we can be more productive and successful in 2019. In this blog, I’ll detail why I developed an Agile approach to goals for the year—and how adopting an Agile mindset can help you achieve yours.

The Four Agreements Of Incident Response

Have you ever been on one of those phone calls with several other human beings where you’re all almost screaming at each other while trying to troubleshoot an issue when something’s going wrong that needs to be fixed right this instant? Did you really enjoy that experience and want to do it all the time? My guess is no.