Operations | Monitoring | ITSM | DevOps | Cloud

November 2018

The Gospel of DevSecOps: Partnering to Love Thy Customer

Raise your hand if you’re a believer in DevOps—thank you, I see that hand. As a former organizer of devopsdays Toronto (2014 – 2016), you could say that I drank the Kool-Aid early on. But thanks to the outstanding research of the DevOps Research Association (DORA) and numerous State of DevOps reports before it, DevOps culture is much less of a religious battle than it has been in the past.

AWS: Operations Health and Best Practices

The ITOps world is a harsh working environment where ITOps personnel are expected to minimize the business impact of incidents at all hours of the day—regardless of the impact to themselves or their families. As more companies undergo digital transformation, the number of alerts and interruptions flowing to IT first responders will continue to increase.

PagerDuty Launches New AWS Integrations for CloudWatch, GuardDuty, CloudTrail, and Personal Health Dashboard

As you may expect from a company founded by former Amazon employees, PagerDuty has been helping AWS users automatically turn any signal into the right insight and action for years. Our Amazon CloudWatch integration enables teams to proactively mitigate customer-impacting issues, which in turn allows organizations to innovate and scale both their AWS and hybrid environments with confidence.

Uptime During the Holiday Shopping Season

In the United States, it’s almost that time of year again where we count our blessings and give thanks. For retail workers, it’s also that time of year where they prepare for the onslaught of eager shoppers who waited hours in line to run into stores to get their hands on doorbuster deals (sometimes knocking down the employees in the process).

Modern Incident Response: The Definitive Guide

To meet the rising demands of customers, organizations are being forced to scale their operations in ways that introduce additional complexity and chaos. More people are involved in operations and in incident response, across an ever-increasing mix of systems, applications, tools, and layers of abstraction, resulting in more and more risk to the business.

When Every Minute Matters

Human trafficking is a $150 billion dollar criminal industry that denies freedom to over 40 million people globally—and it happens in every country in the world. Polaris is an organization dedicated to ending human trafficking and restoring freedom to survivors. For over a decade, Polaris has operated the U.S. National Human Trafficking Hotline.

PagerDuty API Introduction

Learn how easy it is to get up and running with the PagerDuty API in just a few minutes. Harness automation in your incident response and digital operations by leveraging PagerDuty’s REST based API. This video covers basic concepts regarding APIs, REST and JSON. You will also be introduced to PagerDuty’s industry leading interactive API documentation that will automatically provide executable API code at your fingertips.

Observability-Driven Development

TDD is table stakes for any good team, but it’s not enough: these days you need ODD: Observability-Driven Development (and Design). Observability should be baked into every step of your software development process, from conception to maintenance period. No pull request should ever be accepted without being able to answer the question, "how will you know if this works?".

5 Best Practices for Resolving Errors Quickly

I love writing software, but I hate dealing with bugs. They take you away from what you want to be doing and often lead you into a rabbit hole. At Sentry—an open-source error tracking platform that provides complete app logic, deep context, and visibility across the entire stack in real time—we have a few tips that we’ve honed over time to make error resolution painless (ok, less painful), including an official integration with PagerDuty.

Incidents as we Imagine Them Versus How They Actually Are with John Allspaw

There is a tendency to imagine (or remember!) incidents as unfolding much neater and orderly than they actually are. Events can lead some engineers scratching their heads about what is happening, while their teammates can instead be confused about how it's happening.

Real-Time Operations Maturity: How Businesses Can Thrive in the Digital Era

It’s rare to find a business today that doesn’t rely on digital technologies and services. Retail is one example: Whether customers are buying online or in store, completing a transaction requires a website or point-of-sale system. The entire supply chain relies on IT services to deliver goods on time, to the right locations, and just like any company today, every department —from development and marketing, to HR and business services—has a critical tech stack.