Operations | Monitoring | ITSM | DevOps | Cloud

How AI Agents Are Redefining the SRE Role

Even the best site reliability engineers (SREs) spend too much time doing reactive work—triaging incidents, gathering context, escalating to the right teams, and documenting what happened. That work is essential, but it’s not where an SRE’s highest value lies. These engineers are hired to build and maintain resilient systems, not play air-traffic control with every alert that hits their queue.

Announcing a forthcoming integration with PagerDuty + Azure AI SRE Agent for faster incident response

The energy at Microsoft Ignite this year was electric. AI was everywhere, and the possibilities are limitless. As developers and operations teams explore what AI can do, one thing became clear: the future isn’t about switching between tools. It’s about intelligent agents working together to help humans solve problems faster. At PagerDuty, we’re building on that excitement.

From Reactive Response to Systemic Resilience: The System That Gets Smarter With Every Incident

Most operations teams are stuck in a reactive loop: Resolving incidents as they happen, then moving on to fight the next fire. This approach keeps things running in the short term, but prevents responders from documenting their learnings in a way that improves overall system resilience. There are practical reasons for this.

Demo Roundups! Building Resilient On-Call Operations for the Holiday Season

The holidays are retailers' make-or-break moment - when every minute of downtime directly impacts revenue and customer experience. Join us for a retail-focused deep dive into building holiday-ready on-call operations that protect your peak season revenue. We'll demonstrate how PagerDuty's new scheduling experience and AI assistance ensure seamless coverage during your busiest - and most critical - time of year.

Five key takeaways from EDUCAUSE 2025: Adopting AI while navigating change

Having just returned from the 2025 EDUCAUSE Annual Conference in Nashville, I want to share some insights on the future of campus IT from the higher education technology leaders in attendance. Every year, this conference provides an opportunity for technology providers and higher ed professionals to connect and explore the latest innovations in higher education technology. Two themes emerged as critical priorities.

Why Agentic AI Adoption Is Accelerating in Europe and What Comes Next

Across Europe, the cautious optimism business leaders held towards AI agents has evolved into more widespread enthusiasm. What was once a curiosity is now core to how many European organizations operate, respond, and innovate. According to PagerDuty’s latest agentic AI survey, three-quarters or more of organizations in France, Germany, and the UK are deploying multiple AI agents. This growing confidence reflects a broader trend.

How to Choose an AI SRE Solution

The AI SRE landscape has exploded over the past year, with vendors racing to add artificial intelligence capabilities to their platforms. For engineering leaders evaluating these solutions, the sheer number of options can feel overwhelming. Some vendors are building AI-native solutions from scratch, while others are retrofitting AI onto existing workflows. Cloud providers are embedding agents into their ecosystems, and observability platforms are adding intelligence layers to their telemetry data.

Unlock Faster Incident Resolution with PagerDuty + Logz.io

Join us live as we demo how PagerDuty and Logz.io work together to supercharge your Root Cause Analysis. See how real-time observability and enriched incident context can help your team detect, triage, and resolve issues in minutes—not hours. Don’t miss this chance to see the integration in action, ask questions, and learn how to keep your teams in sync while driving continuous improvement. Perfect for anyone looking to level up their incident response!

Work Where Your Teams Already Are with PagerDuty's AI Agents for Slack

Modern operations happen in Slack, where teams spend their days collaborating, troubleshooting, and resolving incidents. And while many incident management tools offer Slack-friendly experiences, they lack end-to-end capabilities that teams need. During critical moments, other tools may require users to switch between Slack and their own interfaces, creating friction.