Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Reliability is not about mythical perfection

See what reliability means to Ganesh Seetharaman, Managing Director at Deloitte, and why it's more than high uptime. Full transcript:  Reliability to me is not about achieving mythical perfection. It's about embracing complexity, recovering quickly from failures or incidents, and building trust through transparency and adaptability.

DevAIOps: A Call To Action For The Heroes Among Us

The year is 2025, and I’ve been watching teams discover what happens when you give developers AI superpowers without giving them AI super-governance. It’s like the merchandising scene from Spaceballs: “Vibe Coding: The Flamethrower. The kids love this one.” But here’s the thing: I’m not here to take away the flamethrowers. I’m here to hand out fire extinguishers and maybe suggest we practice in a safe room instead of the living room.

Getting started with the relaxAI API: Sovereign, cost-effective AI

Earlier this year, we launched relaxAI, an AI assistant designed with one paramount focus: your privacy. We’re now excited to announce the relaxAI API is in General Availability (GA) offering an OpenAI interface. This gives UK organizations up to 90% cost savings versus leading providers while ensuring data never leaves UK jurisdiction.

Introducing the Cortex MCP Server

Cortex gives engineering teams full visibility and control over their services, from ownership and standards to service history and production readiness. Our goal is to help teams stay aligned and move faster so they are ready for whatever is ahead. The reality for any engineering team is that developers spend the most of their time in their IDE, not their IDP. And while developers love the context Cortex provides, they don’t love context switching.

Smarter Insights and Pipeline Control - New in DataStream

We’re constantly improving DataStream to make security data management simpler, smarter, and more efficient for modern SOCs. This latest update introduces new capabilities that bring even more visibility and flexibility to your telemetry pipelines. Let’s take a closer look at what’s new.

New in OTel: Auto-Instrument Your Apps with the OTel Injector

As distributed systems scale, maintaining manual instrumentation across services quickly becomes unsustainable. The OTel Injector addresses this by automatically attaching OpenTelemetry instrumentation to applications, no code changes needed. This blog covers how the OTel Injector works, how it integrates with Linux environments, and how to set it up for consistent telemetry across your stack.

Scaling Online Game Infrastructure for High-Engagement PvM Content

The explosive popularity of player-versus-monster (PvM) content in online games brings significant backend challenges, particularly as titles scale globally. Instanced boss fights, real-time combat logic, and mass player concurrency demand robust, responsive server infrastructure that can scale both horizontally and vertically - without degrading the player experience.

From Wallpaper to Web Servers: How One Immigrant Switched from Walls to DevOps in Just Two Years

He landed in the U.S. with a suitcase, a scraper, and a strong back. No tech degree. No connections. Just a willingness to work and a sense that something bigger might be possible. At first, he did what he knew best - he worked as a wallpaper installer. The job was honest, physical, and surprisingly calming. "There's something meditative about smoothing out bubbles," he says. "But after a while, I realized I wanted to build something that didn't peel off the wall."