Operations | Monitoring | ITSM | DevOps | Cloud

Detecting an AWS Outage and DR Lessons

A few weeks ago, on 20th October 2025, AWS suffered a widespread outage in its US-EAST-1 region that affected a large number of customers globally. More than 1,000 apps and websites were impacted including major banks and popular games, streaming and social platforms such as WhatsApp, Snapchat, Fortnite and Pokémon Go.

Data Center Vacancy Rates at an All Time Low: What Can You Do?

Data center vacancy rates in North America have hit record lows, with reports from CBRE and JLL indicating figures between 1.6% and 2.3% as of mid-2025. This is driven by exceptionally high demand from hyperscale and AI users, which is outstripping supply and leading to significant competition for space and power. The tight market is expected to continue through at least 2027, with preleasing of new construction at high levels.

CloudZero: Making Kubernetes Costs Transparent And Actionable

Kubernetes is now the backbone of modern software infrastructure, helping teams deploy, scale, and manage applications efficiently across clouds. But when it comes to understanding costs, Kubernetes remains opaque. Teams often can’t answer basic questions like: How do you solve the gap between engineering usage and financial visibility? CloudZero’s new Kubernetes capabilities are built to address this challenge.

From Crashes to Clarity: What's New in Percepio Detect 2025.2

Think of Percepio Detect as a security camera for your firmware—always monitoring, but only storing data when something unusual happens, such as crashes or performance anomalies. By providing rich debugging information when needed while keeping the overall data volume to a minimum, Detect enables continuous observability over unlimited time, even on resource-constrained devices such as 32-bit microcontrollers.

The four pillars holding up your digital business, and what happens when they crumble

When we published the first Internet Resilience Report in 2024, the world was still reeling from the CrowdStrike outage that left airlines grounded and financial institutions scrambling. A year later, the stakes are even higher. The 2025 edition confirms what many of us already feel every day in IT Operations: resilience is no longer about uptime alone. It’s about protecting revenue, customer trust, and digital performance at scale.

Building dbRosetta Using AI: Part 3, Creating a Database

The AI said I had to do a database first, not code. Who am I to argue? So, with all the prompts outlining the goals of the project, I’ve gone forward with the project, and step one is creating a PostgreSQL database on Azure. This is part three of a multi-part set of articles. I’ll move this list to the bottom of future articles: Part 1: Introducing the Concept of dbRosetta Part 2: Defining the Project & Prompt Templates.

What Happens When You Mix AI With Docker?

Discover how Docker is empowering developers in the GenAI era with tools that simplify AI application development. Docker VP of Product Michael Donovan shares how containers are critical for building, testing, and scaling GenAI applications, plus real solutions for the biggest challenges developers face today.

The Architecture of Automation: Why IT Doesn't Lie

Let’s start with something most people get wrong. Automation isn’t magic. It’s math. It does exactly what it’s told. Nothing more, nothing less. Every action, every response, every output is a reflection of truth in motion. And that’s where value actually begins. Most organizations still treat automation like a shortcut: a way to go faster, to handle more alerts, to “keep up.” But speed isn’t the value. Truth is.

Rollbar + Vercel built for how you ship

Vercel helps you ship fast. We help you ship safe with code‑first observability that connects errors to the code and deploys behind them. Together you get speed with clear insight into what is running in production. Today we’re launching our native integration in Vercel’s Observability category so you can connect Rollbar to your Vercel projects in minutes, map environments cleanly, and track deployments from day one.