Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Optimize Kubernetes cluster cost with Datadog Cluster Autoscaler

Running Kubernetes at scale almost always means paying for more compute than you need. To protect reliability, platform and application teams typically overprovision nodes early in development and keep scaling up as they add features and workloads. They are often reluctant to move to smaller or different instance types without a clear picture of how those changes will affect performance or availability. The result is a fleet of underutilized nodes that silently inflate your cloud bill.

Cortex and Rootly partner to help teams turn incidents into continuous improvement

For many engineering teams, an incident is a chaotic, all-hands-on-deck event. Once the incident is resolved, everyone returns to their regular work and the valuable lessons from the incident are often lost. The result is a cycle of repeated failures and engineer burnout, where incidents are something to be survived, not learned from. At Cortex, our mission is to help engineering organizations build a culture of continuous improvement.

Reliability at Scale: A Conversation with DevOps Leader Ivan Battimiello

For more than a decade, Ivan Battimiello has been building and scaling distributed engineering systems across Europe and the United States. With experience ranging from game development to full-stack engineering and DevOps leadership, he has led operational transformations for global teams, implemented modern reliability frameworks, and introduced advanced automation practices that dramatically reduced system failures.

What's Next for NaaS? Top Trends for 2026

Learn how private connectivity, regional hubs, and AI-driven automation are defining the next evolution of enterprise networking in 2026. 2026 is shaping up to be a big year for networking. We’re moving past the ideas of being simply connected – now, networks are becoming intelligent. As we see our customers lean into AI, multicloud, and automation in every corner of their operations, the way they connect everything is changing just as fast.

KubeCon NA 2025: Universal Mesh, federation, and the end of the "mesh tax"

At KubeCon, we asked a simple question at our booth: "How much is your service mesh costing you?" The answers were eye-opening. Engineers shared stories of 40% resource overhead, multi-second latency spikes during peak traffic, and infrastructure bills that had nearly doubled since mesh adoption. One architect told us they were spending more time managing their mesh than building features.

All Is Calm, All Is Compliant: Staying Audit-Ready Through the Year-End Rush

As the year winds down, I find that most cybersecurity and compliance teams are focused on closing projects, hitting targets, and maybe even planning a well-earned break. But regulators? They don’t take holidays. FCA, PRA, GDPR – they remain vigilant, and so should you. For IT leaders, this season often feels like walking a tightrope: balancing operational demands with the relentless need for compliance.

From FinOps for AI to AI-Native FinOps

One year ago, at AWS re:Invent, we launched CloudZero Advisor, a free, standalone AI assistant that enables anyone to ask questions about cloud spend in plain language. It was the first experiment of its kind in FinOps, a chance to see what people really wanted to know when cost data finally became conversational. Over the past year, Advisor has become a learning engine.

Information as a Strategic Weapon: Building the Architecture of Advantage

Information dominance has become key to battlefield success. The evolution from Network-Centric Warfare to Multi-Domain Operations (MDO) and JADC2 is all about connecting drones, sensors, weapon-systems and decision-makers, across land, air, sea, cyber, and space… in real time. Read about the journey, principles and building blocks, and how Ribbon Communications’ solutions are in the middle of it.

Stop tool sprawl - Welcome to Terraform/OpenTofu support

Provisioning cloud resources shouldn’t require a second stack of tools. With Qovery’s new Terraform and OpenTofu support, you can now define and deploy your infrastructure right alongside your applications. Declaratively, securely, and in one place. No external runners. No glue code. No tool sprawl.