Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

Demo Roundups! Scaled Service Ownership

Are your teams grappling with tool sprawl, fragmented incident management processes, and rising operational complexity? Join us for an in-depth demo of PagerDuty Operations Cloud, where we'll show you how to overcome these challenges through Scaled Service Ownership. Level up your digital operations expertise with PagerDuty Demo Roundups — a series of live, interactive webinars where you can deepen your knowledge in the Operations Cloud and see how PagerDuty can work for you.

What is DORA and how will it affect me?

The Digital Finance Strategy is a European directive that aims to support and develop digital finance in Europe while maintaining financial stability and consumer protection. There are three main components to the package: In this blog post, we’ll attempt to summarize the 113-page DORA proposal, highlighting how it will apply to incident management at financial entities. Side note: we also wrote a blog post about the other DORA, also known as the DevOps Research and Assessments.

Transform ITOps and incident management with AI copilots

There are many ways to apply generative AI to modernize IT operations. Advances in GenAI have paved the way for the development of AI-powered ITOps copilots, which have the potential to transform IT operations. AI copilots offer many benefits for IT, including improved decision-making, accelerated incident management timelines, and optimized workflows.

Top 5 IT outages detected by StatusGator

StatusGator is the world’s best status page aggregator: We aggregate the status of thousands of cloud services and hosted applications from their official status pages. But everyone knows official status pages are often behind and in those critical moments before the status page is updated, you might be thinking “Is it just me? Or is it really down?” StatusGator’s Early Warning Signals solves that by alerting you before providers even acknowledge the incident.

G2: Squadcast Leads in Incident Management and Secures Key Wins Across IT Alerting

We’re thrilled to share that Squadcast has been recognized as a Leader for the second time in the Incident Management Category. This win celebrates our pioneering role in Unified Incident Management, where we bring together On-Call Management, Incident Response, Workflow Automation, AI/ML-powered Noise Reduction, and SLO tracking—all in one platform.

Best Practices for Choosing a Status Page Provider

Downtime is inevitable but what sets successful businesses apart is how they handle it. A key part of incident management is incident communication with both internal and external stakeholders. A status page is a crucial tool for maintaining clear communication with users during outages or service interruptions. There are numerous status page providers available with different features. This article will guide you through best practices for selecting a provider that suits your needs.

Mastering regulatory compliance with incident.io

The origin of incident.io goes back to our days building Monzo, a UK-based bank, where Stephen, Pete, and I first crossed paths. As a bank, compliance with numerous regulations was, unsurprisingly, a top priority. When it came to incident management—something we were very involved in—this meant that every aspect of reporting, policy adherence, and root cause analysis (or "contributing factors," as we called it) had to be managed consistently and meticulously.

Demo Roundups! Operations Center Modernization

Solutions Consultants Nick Gallegos and Gurinder Singh show how the PagerDuty Operations Cloud addresses key challenges through Operations Center Modernization. Discover how it unifies your IT operations stack across Security, Network, and DevOps centers, automates remediation, and eliminates the need for a dedicated NOC by serving as a virtual operations center for distributed teams.

Update October 2024 - AI-based summary of alarm details and comprehensive audit logs

Our October update brings you AI-based summaries of alarm details. This makes complex or technical content much easier to understand in a matter of seconds. In addition, there is now also a comprehensive audit log, which always logs changes made to the system in a comprehensible manner. As always, you can find all the details in this blog article.