Operations | Monitoring | ITSM | DevOps | Cloud

Troubleshooting & RCA with Olly

If troubleshooting still feels harder than it should, check on these two numbers: how many dashboards you have, and how many alerts fire every day. For most teams, it’s hundreds of dashboards and thousands of alerts, a sign of maturity, coverage, and good intentions. On the other hand, we also see that when something actually breaks, that coverage rarely turns into clarity fast enough.

Will humans be replaced by AI? The truth

Agentic AI doesn’t replace analysts, it augments them. The real value comes from making teams more efficient, not smaller. This is the perspective most people miss. Additional Resources: About Elastic Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale. Elastic’s solutions for search, observability, and security are built on the Elastic Search AI Platform — the development platform used by thousands of companies, including more than 50% of the Fortune 500.

How to Generate a New Puppet Module with VS Code and GitHub Copilot

Revolutionize your infrastructure by leveraging AI tooling in the Puppet ecosystem. In this technical demonstration, we explore how to significantly reduce the time required to create new Puppet modules using Visual Studio Code, GitHub Copilot, and the Puppet Model Context Protocol (MCP) server.

Silent Failures: Why AI Code Breaks in Production

You ship a small “safe” change on Friday. The diff is tiny, the tests are green, and the AI assistant was confident. An hour after deploy, your on-call channel lights up. A downstream service is rejecting responses that look fine in code review. Now you’re rolling back and rewriting a fix that should have been obvious if you had real traffic in the loop. This isn’t a hypothetical.

A Step-by-Step Look at how Agentic, Autonomous ITOps Resolves Incidents

Agentic, autonomous ITOps improves incident response by carrying context from detection through resolution, reducing noise, delay, and manual coordination. Most IT incidents don’t fail due to missing data. Monitoring systems generate more than enough signals. The problem is that understanding those signals—and deciding what to do with them—happens in fragments. Engineers move between dashboards, logs, tickets, and chat threads, stitching together context by hand.

What Agentic AI Is Really Made Of (Most People Miss This)

Agentic AI isn’t just an LLM. Without the right context, it gives generic answers. This is the component that makes its decisions actually useful. Additional Resources: About Elastic Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale. Elastic’s solutions for search, observability, and security are built on the Elastic Search AI Platform — the development platform used by thousands of companies, including more than 50% of the Fortune 500.

AI Query Assist for SolarWinds Database Performance Analyzer

Is your database slow? Let AI do the heavy lifting. Watch how SolarWinds DPA’s AI Query Assist transforms query tuning from a manual headache into a streamlined process. This demo shows you how to get instant, AI-powered recommendations for your worst-performing queries while maintaining the control to review and verify every fix. It’s not just about finding the problem—it’s about fixing it faster.

Investigate Issues in Slack: Grafana Cloud Slack App with AI

The Grafana Cloud app for Slack brings observability and incident response closer to where you and your teams already collaborate Ask questions about system health, alerts, on-call schedules, and Grafana Cloud features; manage incidents and alerts; and collaborate with full context.