Operations | Monitoring | ITSM | DevOps | Cloud

Insights Agent: Deep operational intelligence where your team works

This blog post is part of PagerDuty’s ongoing series on how we’re helping customers navigate their journey towards autonomous operations. Read on to learn about how PagerDuty Advance Insights Agent (now Generally Available for Microsoft Teams users) builds towards this vision. As AI accelerates development and teams ship more code than ever, operational data is everywhere; insights aren’t.

Scribe Agent updates: no more manual note-taking or lost context

This blog post is part of PagerDuty’s ongoing series on how we’re helping customers navigate their journey towards autonomous operations. Read on to learn about how PagerDuty Advance Scribe Agent updates (Generally Available) build towards this vision. When a major operational issue hits, there’s always someone drawing the short straw to take on the most thankless job in incident response: scribing the call. Chances are you were already that someone.

PagerDuty Appoints John DiLullo as Chief Executive Officer

Jennifer Tejada Transitions to Executive Chair of Board of Directors After Serving as CEO Since 2016. John DiLullo Brings Deep Enterprise, Product and Go-to-Market Leadership Experience to Lead Next Phase of Growth. Company Reaffirms First Quarter and Full Fiscal Year 2027 Guidance.

PagerDuty's Slack App: New Incident Management Capabilities

We’ll be rolling out new Slack capabilities to eliminate more manual toil from your incident workflow: click once to promote any alert to an incident, get dedicated channels created automatically, page responders without leaving Slack, and manage all your settings in one place. This is part of our path to autonomous operations: reducing toil, protecting your capacity, and letting you stay in flow. If you’re only using PagerDuty for on-call scheduling, you’re missing the full picture.

New enhancements to PagerDuty's SRE Agent: triage faster without waking a human

AI promise and AI capabilities often diverge, with developers often reporting much faster code production, but not enough change in how incidents are handled. When the rate of change is faster than ever, but the rate of recovery from incidents isn’t moving, developers wind up stuck in firefighting mode. And, when these systems fail, it’s costly. According to PagerDuty’s State of AI-First Operations, over a third of surveyed companies report losing $500K per hour of downtime.

Introducing Shift-Based Schedules: Smarter, Faster, and Easier for Any Team

This blog post is part of PagerDuty’s ongoing series on how we’re helping customers navigate their journey towards autonomous operations. Read on to learn about how PagerDuty’s Shift-Based Schedules (planned GA in May) builds towards this vision. PagerDuty has long been the gold standard for on-call management, helping thousands of teams build the foundations of digital reliability.

Activate Your Continuous Learning Flywheel With Post-Incident Reviews in PagerDuty UI

Earlier this year at our H1 2026 launch, we announced PagerDuty’s vision for autonomous operations: a future where AI agents learn from every incident, prevent failures before they happen, and progressively automate so teams can focus on innovation instead of firefighting.

Why Dedicated Incident Channels are the Modern Standard for Slack-Based Incident Response

Where do your teams go during a critical incident? For distributed teams, that war room is a channel in Slack or Microsoft Teams. The question is: are you creating a dedicated space for each incident, or are responders scrambling across DMs, email threads, and general channels trying to piece together what happened? The answer matters. Using dedicated incident channels has become the industry standard for high-performing incident response teams.

Prevent outages with PagerDuty incident retrospectives

Recurring incidents are a symptom of a broken process. Your teams are working hard to get services back online, but constantly battling the same problems is frustrating and not a sustainable approach. What’s reflected here is not a failure in engineering abilities, but a deficiency in the learning that should follow an incident. When incident analysis focuses on finding a single person or team to blame, it creates a culture of fear.

How to use an SRE agent to reduce downtime

An alert in the middle of the night warns of a potential business failure. Manual incident response becomes more complex due to the overwhelming data from distributed and dynamic digital services. With an SRE agent, your engineering team can cut through alert clutter. They can sort through various signals quicker, decreasing burnout and achieving faster, more affordable resolutions. Operational resilience will see its next evolution with Agentic AI.