Operations | Monitoring | ITSM | DevOps | Cloud

Why Local Experts Are the Smart Choice for Portable Split Air Conditioner Setup and Advice

As more homes and businesses in the UK look for flexible ways to stay cool during the warmer months, portable split air conditioners have grown in popularity. Offering stronger cooling performance than standard portable units, they provide a reliable solution without the need for full-scale permanent installation. But to get the most out of these systems, it pays to work with a local expert who understands the equipment - and your property.

Scaling Observability: How We Designed Bindplane to Manage 1,000,000 OpenTelemetry Collectors

Join the live stream at 11 am ET, here. Platform teams tend to start with just one, or in some cases a handful of OpenTelemetry (OTel) Collectors usually running in gateway mode. They then embrace the benefit of a vendor-neutral, standardized, telemetry collector for unified logs, metrics, and traces.

Reimagining the Data Centre: Respect, Resources, and the Path to a Sustainable Digital Future

When most people picture a data centre, they might think of a bland industrial shell—cold, humming, and forgettable. But look closer, and you’ll find something far more profound. A data centre is a heavy, complex, and finely tuned space where an immense volume of information flows, transforms, and creates the digital world we rely on. In many ways, it feels like magic. And yet, so does nature. Both deserve respect.

Why Does Your Network Get Blamed When Trouble Lies Beyond the Firewall?

The familiar scene unfolds: Critical applications are sluggish, user complaints are mounting, and the IT war room is buzzing. Eyes quickly dart towards the network team. It’s an almost instinctual reaction. But what happens when the problem isn't within the corporate LAN or even the data center? What if the real culprit lurks somewhere in the vast, untamed wilderness of the internet, a cloud provider's backbone, or a third-party SaaS application’s infrastructure?

Deploy Istio at Scale With Rancher

Managing and deploying applications across multiple Kubernetes clusters presents significant challenges, especially as the number of clusters grows. Traditional methods, like manually applying Helm charts or manifests per cluster, become cumbersome, error-prone, and difficult to scale or maintain consistency for Day 2 operations. While Rancher allows managing Helm chart repositories and apps, this is done on a per-cluster basis via the UI.

Community Vigilance, Enterprise Response: Addressing CVE-2024-21626 in Rancher

In backend engineering, many days follow a familiar rhythm: coffee, code reviews, maybe deploying a new feature. But occasionally, the routine is interrupted by a message that signals a different kind of challenge, like a Slack notification from the security team: “Hey, we’ve identified a potential issue. Need to sync up.” This post details one such instance—our journey addressing CVE-2024-21626, a privilege escalation vulnerability reported in Rancher.

Optimize Every Second of Every Fulfillment Workflow with Up to 30% Fewer Robots

The scarcity of labor resources and high costs of robotics ownership are pushing fulfillment companies to rethink their automation strategies. Zebra addressed these challenges head on in a recent Making the Case report published in collaboration with Modern Materials Handling. This report delves into the top challenges fulfillment operations are dealing with on the robotics front today.

SentinelOne Outage: Why Early Detection and Independent Monitoring Matter

When SentinelOne, a leader in cybersecurity and endpoint protection, experienced a major outage last week, thousands of organizations were suddenly left in the dark. With SentinelOne down for hours, IT and security teams scrambled for information and updates. But there was a critical missing piece: SentinelOne has no public status page. This gap left customers frustrated, searching for answers on social media, Reddit, and unofficial channels.

Engineering Time is Your Most Valuable Asset: Are You Spending It Right?

Technology leaders often face a tempting proposition from their engineering teams: “We could build this ourselves.” It’s a natural instinct, especially when discussing incident management systems. Your team’s confidence isn’t misplaced – they absolutely could build a basic alerting system. However, the question isn’t about capability; it’s about strategic resource allocation and long-term operational excellence.