Operations | Monitoring | ITSM | DevOps | Cloud

Resolve's Agents of IT podcast - Ep. 17 - Agentic Workflows to Performance Intelligence

In this episode of Agents of IT, Ari Stowe sits down with Geoff McQueen, four-time founder and CEO of Ascendius, to unpack what it takes to navigate AI-driven disruption. Geoff shares a clear framework for where automation is headed, from individual AI use to agent-driven workflows to AI embedded across the business. Most organizations are still early. The real opportunity is in making AI work at the business level.

From Complex to Simple: How Integrated GRC Transforms Compliance, Risk and ITSM Operations

From Complex to Simple: How Integrated GRC Transforms Compliance, Risk and ITSM Operations Your teams face a complex regulatory landscape, limited visibility across departments and the need to demonstrate audit readiness and risk accountability. But you’re not alone; over half of global risk leaders say regulatory complexity is their biggest headache, while many struggle with siloed vendor data, fragmented controls and manual GRC processes.

Pyroscope Community Call LIVE from GrafanaCON 2026

Join us live from GrafanaCON 2026 for the Pyroscope Community Call! We’re kicking things off with a look at everything happening in the Pyroscope ecosystem, alongside special guest Alberto Soto. In this session: We take a look back over the last year in Pyroscope What’s new in continuous profiling What’s coming next From multi-language source code integration and symbolization improvements to OpenTelemetry profiles and performance gains, Pyroscope has evolved rapidly over the past year.

Loki Community Call LIVE from GrafanaCON 2026

Join us live from GrafanaCON 2026 for the Loki Community Call! We’re kicking things off with a look at everything happening in the Loki ecosystem, alongside special guests Poyzan Taneli, Ben Clive, and Trevor Whitney. In this session: We take a look back over the last year in Loki Explore the brand new “Thor” architecture Dive into what’s coming next for logging at scale From a completely new columnar storage format and Kafka-based ingestion, to a redesigned query engine and improved support for high-cardinality data—Loki is evolving to meet the demands of modern logging.

Alloy, OpenTelemetry & Instrumentation Community Call LIVE from GrafanaCON 2026

Join us live from GrafanaCON 2026 for the Alloy, OpenTelemetry & Instrumentation Community Call! We’re kicking things off with a look at everything happening across Alloy and the OpenTelemetry ecosystem, alongside special guests Ted Young, Mischa Thompson, and Liudmila Molkova. In this session: We take a look back at Alloy’s rapid growth and adoption Explore the introduction of the new OpenTelemetry Engine Dive into fleet management, instrumentation, and onboarding at scale.

Release v2.10: Secrets Management, Nagios Plugin Collector, Azure Monitor, and more

What’s New in Netdata v2.10 In this release, Netdata brings powerful new capabilities to help you monitor, troubleshoot, and understand your infrastructure faster without complexity. In this video, we walk through the key updates: Secrets Management – Securely manage sensitive configuration data Nagios Plugins Collector – Extend monitoring using existing Nagios plugins Azure Monitor – Bring Azure metrics into Netdata for unified visibility.

What Does Load Testing Measure? (Top 5 Performance Metrics) | Resilience Testing | Harness

Before you deploy, you need to know if your application can handle real-world traffic. In this video, we break down the 5 essential load testing metrics: Response Time (latency), Throughput (requests per second), Error Rates (system stability), Resource Utilization (CPU/Memory bottlenecks), and User Concurrency. Whether you're into Software Engineering, DevOps, or SRE, understanding these System Design fundamentals is the only way to prevent server crashes and ensure Software Scalability.

Chaos Engineering vs. Traditional Testing: What's the Difference? | Resilience Testing | Harness

Stop treating system outages like surprises and start preparing for them. While traditional software testing is the bedrock of development, using unit, integration, and regression tests to verify that code meets specific requirements, it only accounts for what we expect to happen. Chaos Engineering takes a different approach by shifting the focus from bug prevention to system resilience. Instead of asking "does this work?", Chaos Engineering asks "how does this survive?" by injecting real-world turbulence like network latency or pod failures directly into production-like environments.