Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Toil: Still Plaguing Engineering Teams

Dec 6, 2022 By Damon Edwards In PagerDuty

Our industry has always had localized expressions for work that was necessary but didn’t move the company forward. The SRE movement calls this type of work “toil.” The concept of toil is a unifying force because it provides an impartial framework for identifying — then containing — the work that takes up our time, blocks people from fulfilling their engineering potential, and doesn’t move the company forward.

Read Post

PagerDuty

Read more about Toil: Still Plaguing Engineering Teams

Cyber, incident, downtime: Three words that chill the board, and how to tame them

Dec 2, 2022 By PagerDuty In PagerDuty

There are three words that every member around a boardroom table fears when they hear them strung together: "Cyber... incident... downtime". They are never the precursor to a good meeting! Technology incidents can leave the business in the dark and bring the wheels of industry grinding to a halt. With no operational systems, a Gartner report found that companies can lose up to half a million dollars per hour from severe incidents based on losses and remediation.

Read Post

PagerDuty

Read more about Cyber, incident, downtime: Three words that chill the board, and how to tame them

DERDACK SIGNL4 for Microsoft Sentinel, Defender for Cloud and more

Dec 2, 2022 By SIGNL4 In SIGNL4

Doreen talks us through the value-add of SIGNL4 for MSPs and enterprise customers of Microsoft Security products and how SIGNL4 facilitates an automated and seamless 24/7 oncall management experience. Derdack SIGNL4 is a member of the Microsoft Intelligent Security Alliance (MISA).

View Video

SIGNL4

Read more about DERDACK SIGNL4 for Microsoft Sentinel, Defender for Cloud and more

PagerDuty Operations Cloud Delivers Process Automation on AWS, Delivering Rapid Return on Investment and Better Customer Experience

Dec 1, 2022 By PagerDuty In PagerDuty

Automated Diagnostics for AWS Customers Reduces Manual Work, Improves Resiliency, Enables Consolidation on PagerDuty.

Read Post

PagerDuty

Read more about PagerDuty Operations Cloud Delivers Process Automation on AWS, Delivering Rapid Return on Investment and Better Customer Experience

PagerDuty Incident Workflows for Automated Incident Response Demo

Dec 1, 2022 By PagerDuty In PagerDuty

Leverage Incident Workflows to automate your incident response process. Enjoy a demo of a use case that introduces how to standardize major incident workflows across all P1 and P2 incidents.

View Video

PagerDuty

Read more about PagerDuty Incident Workflows for Automated Incident Response Demo

How to Help Teams Create Optimal Infrastructure for Availability

Nov 30, 2022 By Richard Whitehead In Moogsoft

Teams are locked into a cycle of suffering characterized by the feeling that they are sprinting just to stay still. This morale and productivity-destroying state is caused by an inability to find time to save time. Our new research, The State of Availability Report 2022, discovered that teams know what they want to do—harness cloud and DevOps practices and tools to advance digital transformation—but something’s getting in the way.

Read Post

Moogsoft

Read more about How to Help Teams Create Optimal Infrastructure for Availability

Improving Incident Management with Automation

Nov 30, 2022 By xMatters In xMatters

Incident management is your organization’s first line of defense. When incidents occur, internal teams must be ready to respond quickly. While incidents can happen anytime, it’s unrealistic to expect incident managers to be prepared to perform manual root cause analysis. Manually monitoring and analyzing applications on multiple servers is extremely difficult, which is why human reaction times have traditionally limited the speed of incident management.

Read Post

xMatters

Read more about Improving Incident Management with Automation

What's New: Updates to Incident Response, PagerDuty Process Automation Software & PagerDuty Runbook Automation, Mobile App Experience, and More!

Nov 30, 2022 By Vera Chan In PagerDuty

We’re excited to announce a new set of updates and enhancements to the PagerDuty Operations Cloud in addition to the November Product Launch announcements made earlier this month. Recent development and app updates from the product team include Incident Response, PagerDuty® Process Automation, the PagerDuty Mobile App, Integrations, as well as Community & Advocacy Events updates.

Read Post

PagerDuty

Read more about What's New: Updates to Incident Response, PagerDuty Process Automation Software & PagerDuty Runbook Automation, Mobile App Experience, and More!

7 Incident Management Best Practices to Improve Business Efficiency

Nov 29, 2022 By Brad Saville In Exoprise

Think about the last time your IT systems had an outage: How did your team react to it? Were they organized with a clear idea of how best to resolve the issue? Or was it chaotic, with people firing questions from all directions and customer service channels ablaze with requests for help? Digital technology disruptions are typical (and even expected) at the workplace, but it doesn’t have to be chaotic, with teams rushing around to extinguish the metaphoric fire.

Read Post

Exoprise

Read more about 7 Incident Management Best Practices to Improve Business Efficiency

For True Observability, Look Beyond Metrics, Logs & Traces

Nov 29, 2022 By Interlink In Interlink

Achieving full, 360-degree observability across your entire IT ecosystem and application components can be thwarted by the disconnect between technical monitoring and business outcomes - running the risk of catastrophic service failures.

View Video