Operations | Monitoring | ITSM | DevOps | Cloud

Demo Roundups! What's New in Schedules: Flexible Shifts + AI Conflict Resolution

Manual scheduling and on-call gaps cost your team sleep and sanity. Join us for a demo of PagerDuty's latest schedule experience improvements. From iCal-compatible shift management to AI-powered conflict resolution, see firsthand how to build bulletproof on-call coverage with minimal operational overhead.

C15 Roadmap & Release 22

We’re excited to launch Release 22—our most advanced update yet. It delivers smarter controls, deeper customization, and long-term reliability. Key improvements include enhanced handling of TTY messages with Wireshark support, flexible call history recording, new Stir/Shaken override options for better traceability, and real-time call limit tracking with an upgraded interface. Plus, starting March 25, 2026, SIP code 603+ will notify callers when calls are blocked due to analytics, in line with FCC regulations.

Optimizing GPU Efficiency and AI Costs with Pepperdata

As AI workloads explode, platform owners face an increasingly common challenge: a massive gap between GPU demand and supply. Pending workloads, idle GPUs, and rising costs make it harder than ever to scale AI efficiently. In this video, we explore how Pepperdata.ai helps enterprises regain control of their GPU environments with two breakthrough solutions: Demand Optimization – Get granular visibility into GPU usage across your entire infrastructure. Identify inefficiencies, balance supply and demand, and uncover hidden capacity.

Scale Chaos Engineering with Automation and AI

Chaos Engineering and Fault Injection testing have been proven to prevent outages, increase availability, and help companies avoid costly downtime. But without the right processes or tools, they require specialized knowledge, a deep understanding of systems, and manual effort for every test. To fully realize the benefits of Chaos Engineering, testing needs to be adopted across all engineering teams without causing a lift or investment that takes away from roadmap progress.

Single-Cloud Dependency Is a Disaster Waiting to Happen

The impact of the AWS outage has reminded many businesses of the risk for businesses that rely heavily on centralised cloud infrastructure, especially when so many essential services are concentrated in a single region. But at the wider industry level, this is also a warning around the widespread lack of contingency planning for cloud failures. Reactive response must give way to strategically planned disaster recovery protocols that engender a resilient cloud market.

Get organized, actionable insights from complex test environments with Datadog Test Suites

Modern teams often run hundreds of synthetic tests across multiple services, environments, and user journeys. While these tests provide deep visibility, managing them as a flat list can quickly become overwhelming, especially as organizations scale and teams specialize.

Top 11 Ruby APM Tools for 2025: A Performance-Driven Selection

Observability has become a core part of running Ruby applications at scale. Knowing how your app performs — from request latency to background job execution — helps catch slowdowns early and improve reliability. This blog walks through some of the most useful APM tools for Ruby in 2025. Each section highlights what the tool does well, where it fits best, and what kind of visibility it brings to your application's performance.