Operations | Monitoring | ITSM | DevOps | Cloud

Mastering Service Configuration in Icinga Director

The Icinga Director configuration tool makes it easy to define monitoring objects through the web UI and deploy them to the Icinga 2 API. In this blog post, I’ll walk you through how to configure services in Icinga Director. If you haven’t used Icinga Director yet, take a look at our introduction. I assume that most of you are already familiar with Icinga 2 and have used the DSL to define objects.

Automated Diagnostics & Triage: The Fastest Way to Cut Incident Time

Too many incidents waste valuable engineering time on the basics: collecting logs, pulling system data, and tracking down the right person to fix the issue. Meanwhile, customers experience delays, SLAs are breached, and critical work gets pushed aside. The real kicker? Those L3 and L4 severity incidents that could actually prevent future fires get labeled as “nice to have” and collect dust in your backlog. Automated diagnostics and triage eliminates these bottlenecks.

The PagerDuty Vision for AI-First Operations

Something fundamental needs to change in the way we run operations. Organizations are deploying AI to optimize everything from coding and deployment to resource planning and incident management. But they’re discovering that managing AI-powered systems requires a completely different operational mindset. AI models hallucinate. Data pipelines degrade silently. Algorithms develop bias without warning.

Fiber Paths and Failsafes: Why Your Network Design Matters

Redundancy isn’t just a buzzword – it’s the design principle keeping modern AI and cloud applications online. In this Uplink episode, Kevin Schlosser, Interconnection Product Manager at NTT Global Data Centers, explains how resilient infrastructure is engineered to expect failure but remain operational. We explore: Diverse entry points and fiber path management AI-driven bandwidth growth: 100G standard, 400G emerging Cooling innovations for intense compute workloads Why providers without their own fiber may offer the most resilient paths.

LTS vs. upgrades: which future are you building for?

How should businesses decide between sticking to an LTS release or moving to a continuous upgrade model? In this episode, we explore the trade-offs, from stability and security to innovation and agility, and why flexibility in your upgrade policy is key to long-term success. We break down when LTS makes sense, when frequent upgrades deliver the most value, and how to balance both to keep your business secure, stable, and ready for what’s next.

Early Warning Signals: Now in Microsoft Teams

As promised, we’re continuing to expand our Early Warning Signals coverage. In addition to our recent integrations for Slack, SMS, and Webhooks, we’re excited to announce that Early Warning Signals now works in Microsoft Teams. This is another step toward making early outage alerts accessible wherever your conversations happen.

REST easy with REST Packs

The countdown to CriblCon 25 is on and we’re giving you an exclusive first look at the expert insights, innovative solutions, and success stories you’ll see on the big stage. REST collector configuration can be painful, requiring navigating to multiple screens and importing multiple configuration files, but it’s about to get a lot easier. Join Cribl experts to preview how easily you can install and build new packs with new enhancements.