Operations | Monitoring | ITSM | DevOps | Cloud

Incident Management

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Shifting left on incident management

In the fast-paced world of software development and product delivery, incidents are often viewed as unwanted disruptions. Traditionally, incident management might only trigger for critical issues, like complete system outages, data loss of some kind, or security-related ones - you don’t need to go back that far for a few that were very serious: Heartbleed, xz utils, and more.

incident.io On-demand: On-call as it should be, present and future

Since the inception of incident.io, we set out to build the single destination companies turn when things go wrong. With the release of On-call, we’ve achieved just that. From waking your team up at 2am to gleaning insights from incidents, we’ve got you covered. From our sleek, intuitive mobile app to customizable workflows, incident.io is built for the way modern teams actually work—featuring a robust platform of Response, On-call, and Status Pages.

#6 Virtual Meetup: PagerDuty Session: James Pickles (Solutions Consultant @ PagerDuty).

Elevate your biz & enhance your automation skills! Get together with the Rundeck by PagerDuty Process Automation crew and learn how automation is leading the way to innovation and fast tracking business for the future!🚀 Hear success automation stories from Diego Infiesta (IT Infrastructure Manager @ Ryanair) & Hans Erasmus (Director @ HBPS Consulting), and dive into the world of open-source automation with James Pickles (Solutions Consultant @ PagerDuty).

Introducing Squadcast and ServiceNow Bidirectional Integration For Enhanced Operational Efficiency

Discover everything about the powerful ServiceNow Squadcast bidirectional integration, its key features and benefits, designed to streamline incident resolution and enhance collaboration within your DevOps and IT teams. Key takeaways:​Accelerate Incident Response: Streamline incident response and accelerate resolution directly through Squadcast and ServiceNow ​Enhanced Learning and Retrospectives: Simplify tracking, retrospectives, and learning for your engineering team, ensuring a more efficient and productive incident management process.

How Incidents Foster Leadership

To become battle-tested, you need to go through battles, not just read books or mentor newcomers. Both are helpful but the stakes are low. On the other hand, high stake jobs, such as running a big project or managing a team, are hard to get when you lack experience. So how can we solve this dilemma? Enter incident response.

The Challenges of Rising MTTR - And What to Do

Data volumes are soaring. Environments are increasingly intricate. The risk of applications and systems encountering breakdowns is sky-high, and the mean time to recovery (MTTR) for production incidents is moving in the wrong direction. Disruptions not only jeopardize critical infrastructure but also have a direct impact on the bottom line of organizations. Swift recovery of affected services becomes paramount, as it directly correlates with business continuity and resilience.