Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

When an IT Incident Occurs at Your Company, What TV Show Does It Most Resemble?

If you’re like us, you’ve been binge-watching a lot of shows over the past few months. That got us thinking—do you ever consider your company's problem resolution process to feel like an episode of one of your favorite shows? We ran a short poll to see how IT teams would relate their incident resolution processes to popular TV shows. Here are the results. We ran a short poll to see how IT teams would relate their incident resolution processes to popular TV shows.

ITIL Incident Management: Taking a Structured Approach to Incident Resolution

Business continuity has become a key priority for most management teams and their IT associates. Every single minute lost in downtime can result in potentially bloated overheads and reduced revenues. That said and done, no matter how well-engineered the network is, there will be some issues and problems in its due course of operations. ITIL broadly defines an incident as an unplanned incident that interrupts a service or has the potential to interrupt service if not addressed immediately.

OnPage Incident Management - Perfect for ITOps, Clinical and Crisis Communication

Consolidate IT alerts on to one platform. Access time stamped alerts with relevant information. Manage incident responders and stakeholders through secure messaging, live ticket updates and postmortem reporting. Rock-solid reliability. Clinical Communications Platform Connect healthcare personnel through HIPAA compliant messaging and alerts. Manage on-call shifts and automate alerts. Real-Time Call Routing connects patients to caregivers.

Amazon CloudWatch Integration

An OnPage high-priority, mobile alert is triggered when CloudWatch detects an anomaly. OnPage notifies the right person using alerting policies, routing rules and on-call schedules. The integration minimizes the time it takes to identify and respond to incidents occurring in AWS resources or applications. About OnPage Organizations large and small, are adopting OnPage's intelligent alerting solution, ensuring that encrypted, secure critical incident notifications are NEVER missed and are always delivered to the right person at the right time.

PagerDuty at AWS re:Invent-New Tools to Power AWS and Your Cloud Migration

Leave it to Amazon Web Services to find a way to make their massive celebration of all things cloud entirely virtual, free, and even bigger. Even though we won’t be able to join you all in Las Vegas for Amazon’s celebration of all things cloud, PagerDuty is very excited to be a Gold sponsor of re:Invent again this year. Be sure to stop by our sponsor page for a product demo, the latest on our newest AWS integrations, grab your swag bag, or participate in one of our fun booth activities.

Masterclass: Advanced series Session 1 - Hack your ServiceDesk Plus for the new normal

Learn a few advanced features of ServiceDesk Plus that enable you to create a virtual office experience for your requesters and technicians. Masterclass+ is a webinar series focussed on training ServiceDesk Plus administrators on advanced features, configurations, and integrations.  

How to SRE without an SRE on your team

Are terms like “Error budgets” and SLOs roadblocks on your way to adopting SRE practices for your organisation? Our latest blog talks of "How to SRE without an SRE on your team", where we look at some of the most elementary SRE concepts that you can start implementing right away! We help you pick SLOs, identify toil and touch base on Automation for SREs along with few best practices to get you started on your SRE journey.

Masterclass: Advanced series session 2 - Build a high velocity incident response tool chain

In this session of the advanced masterclass series, you'll learn how to link ServiceDesk Plus to the ManageEngine operations tool chain and how to operate an analytics-driven service desk. You'll also learn about features that will help you separate management and bureaucracy, enabling you to accelerate your service desk operations.

Masterclass: Advanced series Session 2 - Hack your service desk for the new normal (Cloud)

In this session of the advanced masterclass series, you will discover ways to adapt your service desks to the current crisis and learn how integrations with Microsoft Teams, Jira and Slack work in the cloud version of ServiceDesk Plus.

What's the Difference Between MTTR, MTTD, MTTF, and MTBF?

We’ve all been there. You’re on an important Zoom call with your team, and someone uses an abbreviation you’re not familiar with. You’ve heard it, but you’re not quite sure exactly what it means. You want to do a quick Google, but you’re sharing your screen! Ugh. Let’s pull apart some of these abbreviations for incident management KPIs (Key Performance Indicators). Now, you won’t find yourself SOL at your next Zoom call with the Support team.