Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Containers, Kubernetes, Docker and related technologies.

10 Enterprise AI Infrastructure Voices Worth Following

Enterprise AI has crossed an inflection point. The model problem is largely covered. What remains unsolved is the operational impact: how to run AI inference and agentic processes continuously, reliably, and at a cost that doesn’t cancel out the value. Most enterprises are discovering this the hard way. GPU utilization dashboards show 80%. Actual compute efficiency is half that. Token demand is compounding at 200-500% annually as agents multiply every action into dozens of model calls.

21 AI concepts every beginner should know before their first interview

If you’re prepping for your first AI or MLOps interview, the hardest part usually isn’t always the hands-on element. For me, it’s the vocabulary. Interviewers sometimes lob single-word concepts at you (“what’s quantization?”) and watch how far you can carry the thread. The questions sound clear-cut, but each one is really a doorway into a bigger topic, and the interviewer is judging how cleanly you walk through it.

Blackwell sold out in weeks. Here's what Rubin demand will look like.

"Blackwell sales are off the charts, and cloud GPUs are sold out. Compute demand keeps accelerating and compounding across training and inference, each growing exponentially. We've entered the virtuous cycle of AI." Jensen Huang, CEO, NVIDIA When NVIDIA's CEO makes that statement in a quarterly earnings release, it is not marketing language.

How to deploy Canonical Managed Kubeflow on Microsoft Azure?

Learn how to deploy Canonical Managed Kubeflow on Microsoft Azure step by step. Canonical's Managed Kubeflow on Azure gives enterprise and startup AI teams a fully operational, open source MLOps platform in under an hour. It is managed 24/7 by Canonical's engineers. This means you can focus entirely on building models rather than running infrastructure.

What's new in Calico: Spring 2026 Release

Kubernetes has come a long way since its debut in 2014. It’s gone from running a couple of containerized microservices to orchestrating fleets of production workloads spanning everything from AI agents to full scale VMs running in pods. As Kubernetes adoption grows, and its use cases stretch to cover more ground, managing its increasingly complex networking and security landscape demands operational maturity and a platform that supports it.

Introducing Cycle's European Control Plane: Strict data sovereignty, lower latencies, and more

We're thrilled to announce that Cycle's European Control Plane is now live! While a few organizations have been utilizing it over the past month, we're eager to officially open access to all teams. Before diving deeper into the "why," let's clarify what a Cycle Control Plane actually is. If you visit our status page, you'll see a list of the core services powering Cycle. These services include everything from our APIs to our 'factory' build systems.

Understanding GPU cloud instance types: How to read a spec sheet for real-world ML performance

A GPU spec sheet is a confidence trick. It looks like an objective document - numbers, units, comparable rows - but most of the numbers on it don't map cleanly to the performance a real workload will see. Teams that pick GPUs by reading the headline figures usually find out the gap between spec and reality somewhere around the first production run. This is a working guide to reading GPU cloud instance specifications against actual ML workloads. The goal isn't to recommend a card.

The Lovable Experience. Enterprise Governance. Your Infrastructure. We Built It.

Introducing the AI Builder Portal - the governed alternative to Lovable and Bolt.new for enterprise. Same one-click builder experience, running on your Kubernetes cluster, under your governance. Romaric founded Qovery to make Kubernetes accessible to every engineering team. He writes about platform strategy, developer experience, and the future of cloud infrastructure.