Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

SRE Leaders Panel: SRE Adoption as Organizational Transformation

Apr 6, 2021 By Blameless Community In Blameless

Blameless recently had the privilege of hosting SRE leaders Kurt Andersen, SRE Architect at Blameless, Vanessa Yiu, Executive Director, Enterprise Architecture at Goldman Sachs, and Tony Hansmann, Former Global CTO at Pivotal Software, Inc.

Read Post

Blameless

Read more about SRE Leaders Panel: SRE Adoption as Organizational Transformation

Shifting Security Left: Tools and Best Practices

Apr 6, 2021 By OnPage Corporation In OnPage

Software development pipelines typically cycle through key four processes—design, development, testing and software or update releases. Traditional pipelines perform quality and security tests only after completing the development phase. Since there is no such thing as a perfect code, there are always issues to fix. However, if significant architectural changes are needed, fixing them at the end of the process can be highly expensive.

Read Post

OnPage

Read more about Shifting Security Left: Tools and Best Practices

So you Want an SRE Tool. Do you Build, Buy, or Open Source?

Apr 5, 2021 By Emily Arnott In Blameless

As your organization’s reliability needs grow, you may consider investing in SRE tools. Tooling can make many processes more efficient, consistent, and repeatable. When you decide to invest in tooling, one of the major decisions is how you’ll source your tools. Will you buy an out-of-the-box tool, build one in-house, or work with an open source project? This is a big decision. Switching methods half-way through adoption is costly and can cause thrash.

Read Post

Blameless

Read more about So you Want an SRE Tool. Do you Build, Buy, or Open Source?

Behind the redesign of Spike.sh On-call

Apr 5, 2021 By Rajni Reddy In Spike

Background We recently released the biggest overhaul to one of the core features of Spike.sh - On-call schedules. Software teams use on-call schedules to designate first responders who will handle issues when they occur.

Read Post

Spike

Read more about Behind the redesign of Spike.sh On-call

5 Ways Unplanned Work Is Disrupting Your Business

Apr 5, 2021 By Steve Barrett In PagerDuty

Unplanned work is rising, with consequences ranging from unhappy customers and lost revenue, to employee churn and burnout. So what is the true business cost of wasted time? In this blog, we will explore how one employee’s wasted time can impact the whole company—from operations, to development and beyond.

Read Post

PagerDuty

Read more about 5 Ways Unplanned Work Is Disrupting Your Business

Product Update: Upgrade to Exporting your Retrospectives

Apr 2, 2021 By Blameless Community In Blameless

Blameless is excited to announce an enhancement to our Incident Retrospective tool! The Export feature now allows for customizable retrospectives.

Read Post

Blameless

Read more about Product Update: Upgrade to Exporting your Retrospectives

How SREs Can React to COVID-19's Impact on Incident Management

Apr 2, 2021 By Quentin Rousseau In Rootly

By adding new complexity to reliability engineering and making physical war rooms a thing of the past, COVID-19 has imposed permanent changes on incident management. Here’s how SREs can respond.

Read Post

Rootly

Read more about How SREs Can React to COVID-19's Impact on Incident Management

Enabling Customer Service With Full Visibility Into Customer-Impacting Issues

Apr 2, 2021 By Inga Weizman In PagerDuty

We are delighted to announce a new Status Dashboard for the Zendesk Customer Service integration. The dashboard enables customer service agents to have real-time visibility into major incidents that are impacting their customers within the Zendesk tool suite, so they can proactively update customers when an incident occurs.

Read Post

PagerDuty

Read more about Enabling Customer Service With Full Visibility Into Customer-Impacting Issues

Strategies to Reduce Alert Fatigue in Your SOC Team

Apr 2, 2021 By Ritika Bramhe In OnPage

In a SOC (security operations center), alerts originating from hundreds of systems compete to get attention. What ensues is a security analyst’s battle to beat alert fatigue while effectively defending their organization from cybersecurity threats. Alert fatigue is a major challenge faced by security operations center (SOC) teams. The stakes are even higher since they take on the enormous responsibility of maintaining networks and data systems.

Read Post

OnPage

Read more about Strategies to Reduce Alert Fatigue in Your SOC Team

How to configure services in Squadcast: Best practices to reduce MTTR

Mar 31, 2021 By Biju Chacko In Squadcast

With a rise in digital platforms, IT infrastructure has grown exponentially complex to a level where multiple application interdependencies coexist with varied architecture & oncall team types. This blog looks at how you can model your infrastructure in Squadcast to reduce your time to respond & resolve incidents.

Read Post