Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Containers, Kubernetes, Docker and related technologies.

Understanding Docker monitoring: A comprehensive list of key Docker metrics

In today’s fast-paced development landscape, containerization has become a cornerstone for deploying scalable and efficient applications. Docker, as one of the most popular container platforms, offers a robust environment for building and running containers. However, with great power comes the need for greater scrutiny, i.e., Docker monitoring or observability. Understanding Docker metrics is key to maintaining optimal performance and ensuring your containerized applications run smoothly.

SUSE and RKE2 are introducing KubeSleep: Smart Kubernetes Scaling Based on Developer Inactivity

We’re excited to announce Kubesleep, a smart Kubernetes operator developed by SUSE that optimizes cluster efficiency and significantly reduces infrastructure costs. Kubesleep automatically scales workloads based on actual developer activity, intelligently detecting periods of inactivity and scaling down resources to save energy and expenses. Best of all, your clusters smoothly scale back up before developers even notice.

Ending the IngressNightmare: How SUSE Secures Your Kubernetes Clusters from External and Internal Threats

In March 2025, Wiz researchers disclosed a set of critical vulnerabilities in the popular ingress-nginx controller for Kubernetes. Collectively referred to as IngressNightmare, these issues (CVE-2025-1097, CVE-2025-1098, CVE-2025-24513, CVE-2025-24514, and CVE-2025-1974) allow unauthenticated attackers to exploit the Ingress admission controller, potentially achieving remote code execution or escalating privileges in the cluster.

Back to the Metal

Bare metal is BACK! For years virtualization has absolutely dominated the cloud market. The market for virtualization is still 10x larger than bare metal ($8B USD vs$100B USD). But now consumers are demanding MORE for their workloads. … and the signal from the data suggest that this trend isn't going away anytime soon. If we look a bit deeper, we might see another story enabling the avalanche of (re) adoption in bare metal.

Kubernetes Monitoring: One view for observing all your storage volumes

If you want to observe your entire Kubernetes environment, you need visibility into all of your resources, including storage volumes. But monitoring Kubernetes storage hasn’t always been easy, especially if you wanted to see how it related to other parts of your infrastructure.

9 Best Container Monitoring Tools You Should Know in 2025

In a world where containers power everything from startup MVPs to enterprise applications, keeping tabs on your containerized environment isn't just good practice—it's survival. Container environments are notoriously dynamic and ephemeral, creating unique monitoring challenges that traditional tools simply can't handle. We've sorted through the noise to bring you the nine tools that deliver.

Optimizing Kubernetes node resources: How to avoid exhaustion and improve performance

Resource exhaustion at a node remains a critical issue. However, the automation of deployment and management of containerized applications is executed relatively efficiently in Kubernetes. When a node is low on resources—as in CPU, memory, or storage—a workload may suffer from failures, degraded performance, and eviction.

Scale Anything: How Komodor Enhances Autoscaler Capabilities

Kubernetes autoscalers like Cluster Autoscaler (CAS) and Karpenter have evolved significantly to manage the sprawling Kubernetes ecosystem, which has grown far beyond a simple container orchestration platform to include a vast array of add-ons, operators, CRDs, and third-party integrations. These autoscalers play a crucial role in ensuring K8s workloads get the resources they need, precisely when they need them, without creating excess and waste.

Remediate Kubernetes incidents faster using private actions in your apps and workflows

The Datadog Action Catalog provides more than 1,400 actions to help you accelerate remediation across your infrastructure directly within Datadog. With actions, you can use Workflow Automation to configure workflows that automatically address issues as they happen and build custom apps in App Builder that empower anyone in your organization to act when incidents occur.