Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Modern ITSM Solutions: Flexibility in Incident Response

We no longer live in a world where a few tools determine the way organizations structure their processes. From IT Service Delivery to Incident Response, Modern IT Operation Solutions need to embody the flexibility that most Enterprises require. The dynamic ITOps ecosystem has shifted to put choice back in the hands of the user. Now, IT Solutions must follow suit. Modern Incident Response platforms, in particular, need the flexibility that enterprises need to mirror their enterprise architecture.

Advice for On-call Teams During COVID-19

I’ve offered some tips up for folks who are oncall during the COVID-19 crisis, but I thought it would be helpful to get some more ideas from people with different perspectives. So I reached out to some people I trust to see what they had to say. They all have different viewpoints, but some themes emerge, like managing alerts, having empathy, and practicing self-care. The participants, in alphabetical order: Aaron Aldrich is a Developer Advocate at LaunchDarkly, with a focus on DevOps.

Remove Manual Bottlenecks in DevOps with AIOps

DevOps pipelines generate massive amounts of data. To maintain the stability and speed of application delivery, operations leaders must analyze it quickly and continuously. But how can they keep DevOps — and their business — agile? Gartner’s “Augment Decision Making in DevOps Using AI Techniques” provides, in our view, the answer for operations leaders to make precise data-driven decisions and automate actions for rapid application delivery.

Optimizing your alerts to reduce Alert Noise

Reducing alert fatigue starts from your monitoring platform - setting the right thresholds to trigger alerts and understanding which of these are essential to be sent into your on-call platform is a start. This post outlines some of the best practices that help you reduce alert noise and improve your on-call experience. The word noise implies something unpleasant and unwanted. You combine that with on-call and it adds a factor of annoyance to the already overwhelming process.

Challenges Faced by MSPs in Light of COVID-19

The COVID-19 crisis has proven to be a challenging time for IT support teams and managed service providers (MSPs). It hasn’t only left these organizations in a vulnerable position, but also in a state of uncertainty as to what may be in store for them. OnPage interacts with current and prospective clients ranging from large businesses to small and medium enterprises (SMEs).

Virtualize the NOC: Accelerate Your Transition to Remote IT Ops with AIOps

The sudden shift to remote work caused by the global pandemic has forced IT Ops pros to quickly adjust in multiple ways to maintain the uptime and stability of critical digital services. Amidst this crisis, AIOps has emerged as a lifeline, as it facilitates remote collaboration, streamlines incident management, and accelerates detection and resolution.

How PagerDuty's Ecosystem Partners Are Helping People During the COVID-19 Crisis

For many of us, “working” is incredibly difficult right now. That’s true at the organizational level, where maintaining business continuity and accounting for changes in customer needs are even more critical. But it’s also true at the individual level, where the sudden shift to working from home has jolted us all into working in new ways, and made virtual collaboration an essential part of each workday.