Operations | Monitoring | ITSM | DevOps | Cloud

Release v2.10: Secrets Management, Nagios Plugin Collector, Azure Monitor, and more

What’s New in Netdata v2.10 In this release, Netdata brings powerful new capabilities to help you monitor, troubleshoot, and understand your infrastructure faster without complexity. In this video, we walk through the key updates: Secrets Management – Securely manage sensitive configuration data Nagios Plugins Collector – Extend monitoring using existing Nagios plugins Azure Monitor – Bring Azure metrics into Netdata for unified visibility.

Seeing the Bigger Picture: What technical leaders can learn from evolving monitoring needs

A preview of leadership insights shaped by real-world experience Estate-wide clarity for leaders who still need technical depth As data estates grow, the role of technical leaders changes. Visibility becomes harder. Communication becomes more important. Decisions have broader consequences. Many leaders start their careers focused on the technical details.

What Does Load Testing Measure? (Top 5 Performance Metrics) | Resilience Testing | Harness

Before you deploy, you need to know if your application can handle real-world traffic. In this video, we break down the 5 essential load testing metrics: Response Time (latency), Throughput (requests per second), Error Rates (system stability), Resource Utilization (CPU/Memory bottlenecks), and User Concurrency. Whether you're into Software Engineering, DevOps, or SRE, understanding these System Design fundamentals is the only way to prevent server crashes and ensure Software Scalability.

What's New in VictoriaMetrics Cloud Q1 2026? Logs, MCP Server, Better Alerting, and... a Secret Project

Q1 2026 has been one of our most eventful quarters yet for VictoriaMetrics Cloud. We shipped something we have been building towards for a long time, crossed a few infrastructure milestones, and started clearing the path for what is coming next to the most performant observability stack.

Chaos Engineering vs. Traditional Testing: What's the Difference? | Resilience Testing | Harness

Stop treating system outages like surprises and start preparing for them. While traditional software testing is the bedrock of development, using unit, integration, and regression tests to verify that code meets specific requirements, it only accounts for what we expect to happen. Chaos Engineering takes a different approach by shifting the focus from bug prevention to system resilience. Instead of asking "does this work?", Chaos Engineering asks "how does this survive?" by injecting real-world turbulence like network latency or pod failures directly into production-like environments.

The Modern Messaging Primer: Navigating the Shift from Legacy Middleware to Open Source Innovation

The shift from legacy middleware to open-source innovation promises agility and cost savings, but introduces the 'Modernization Tax'—operational complexity that requires new approaches to observability, governance, and management across hybrid messaging environments.

KubeVirt Networking: How to Preserve VM IP Addresses During Migration

Organisations are re-evaluating their VM infrastructure. The economics have shifted, the tooling has matured, and the case for running two separate platforms, one for containers, one for VMs, is getting harder to justify. Platform teams that spent years managing hypervisor infrastructure are being asked to consolidate, and most are landing on the same answer: Kubernetes. KubeVirt makes running VMs on Kubernetes possible.