Komodor

https://komodor.com/

Tel Aviv, Israel

2020

5 Optimization Blockers You Didn't Know Were Inflating Your Cloud Bill

Jul 23, 2026 | By Komodor

Most cloud-native cost tools are built to find and address waste reactively. Underutilized nodes, oversized requests, and idle workloads are revealed in the utilization data, the fixes are well documented, and the initial savings these tools drive are very real. But what we’ve seen consistently across clusters is a different category of blocker, one that quietly prevents consolidation and strands capacity your autoscaler can never reach. They don’t surface in dashboards as obvious waste.

Read Post

The Investigator That Remembers: Inside Klaudia Memory

Jul 16, 2026 | By Nir Adler

There is a particular kind of incident every SRE team is familiar with. A common component of your stack, say your Redis database, starts misbehaving. Someone spends two hours tracing it back to a connection pool exhausted by a misconfigured client, the fix goes in, and everyone moves on, for today. The following Tuesday it happens again, and whoever is on call investigates it from scratch, because the person who solved it last week is asleep, on vacation, or working somewhere else now.

Read Post

Building AI SRE Agents, Part 1: Start Local, Break Things, Learn Fast

Jul 9, 2026 | By Nir Adler

The first stage of AI SRE maturity is a laptop, a throwaway cluster, and zero production access. Here’s how to set it up, and what to watch for. AI SRE (Site Reliability Engineering) agents are AI-powered systems that automate the most time-consuming parts of incident response: triaging alerts, correlating logs and metrics, generating root-cause hypotheses, and proposing remediation steps.

Read Post

Klaudia Under the Hood: How We Built an AI SRE That Actually Earns Trust

Jun 18, 2026 | By Asaf Savich

In reliability engineering, being ‘mostly right’ is a liability. An AI SRE that sometimes misses the root cause or gives a confident, wrong answer at 2:17 AM has no place in an enterprise cloud environment. In this context, silence is better than noise. That’s the bar Klaudia is built to clear: genuine reliability that you can trust in production. The kind of reliability that earns a place alongside your best engineers. Getting there requires more than just a capable model.

Read Post

Komodor Unveils Proactive Optimization to Unlock Stranded Cluster Capacity

Jun 10, 2026 | By Komodor

AI SRE Platform’s new capabilities resolve structural blockers and optimize workload placement, eliminating cloud waste to drive up to 80% in total cost savings.

Read Post

The Two-Sided Scheduling Problem: Reaching the Next Layer of Cloud Savings

Jun 10, 2026 | By Adi Fayer

You’ve deployed Karpenter or Cluster Autoscaler and tightened your resource requests, but while you saw an initial dip in your cloud bill, your savings have flatlined. Organizations that thought they had the fundamentals of cloud cost under control are now seeing stagnation. The problem isn’t that they need another FinOps tool or better visibility. The problem is that the current state of enterprise cloud cost optimization strategy is fundamentally reactive.

Read Post

Solved: fatal: Not a git repository (or any of the parent directories): .git

May 15, 2026 | By Itay Kimia

The fatal: not a git repository (or any of the parent directories): .git error means Git cannot find a.git directory in your current folder or any parent folder. In most cases, you are either in the wrong directory, the project was never initialized with Git, or the.git folder is missing or corrupted.

Read Post

The FinOps Competitive Landscape in 2026 - When Cost Optimization Meets Reliability

May 14, 2026 | By Ilan Adler

The dashboard says you can save 30%. The SRE team won’t sign off. You’ve probably been in this meeting. Finance has a number. The platform team has a scar. Somewhere between them sits a senior manager, maybe you, being asked to choose a cost optimization tool that one side will champion and the other side will quietly refuse to deploy in production. The standoff isn’t about price. It’s about trust.

Read Post

Rightsizing Nightmares: When Your Cloud Cost Tool Degrades Performance

Apr 30, 2026 | By Ilan Adler

This is what production teams see happening. A vertical pod autoscaler recommendation gets applied automatically. Resource requests come down a notch across a namespace. The cost dashboard registers a small cost savings win. A few minutes later, health checks start failing. Pods enter crash loops.

Read Post

All You Need to Know About CrashLoopBackOff Error

Apr 27, 2026 | By Komodor

Kubernetes is an open-source container orchestration engine that is used to automate containerized application deployment, scaling, and administration. It is an open-source management platform that can be used to manage containerized workloads and services, as well as declarative configuration and automation. Kubernetes is a framework for running distributed systems in a resilient manner. It handles scaling and failover for your application and provides deployment patterns and other features.

Read Post

#060 - Beyond ELK: Elastic's 10-Year Evolution, Open-Source Licensing, and the AI Frontier with P...

Jun 11, 2026 | By Komodor

In this episode of the Kubernetes for Humans podcast, Philipp shares his incredible 10-year journey at Elastic, witnessing the company's massive growth from 300 to 4,000 employees. Discover the fascinating origin story of how Elastic evolved from a simple recipe search project into a global powerhouse for observability, security, and vector databases.

View Video

#059 - From Early K8s to the Edge: Shifting Compute Left with Dave Aronchick

May 19, 2026 | By Komodor

In this episode of the Kubernetes for Humans podcast, tech veteran Dave Aronchick shares his incredible journey from leading the Kubernetes and GKE projects at Google to co-founding Kubeflow and his current venture, Expanso.

View Video

#058 - The Future of AI and Platform Engineering with Blake Sherwood (Smarsh)

May 13, 2026 | By Komodor

In this episode, special guest Blake Sherwood joins the show to discuss his unique career trajectory from tourism and coal mining to leading massive-scale Kubernetes migrations. Blake shares insights from his experience managing petabytes of data in high-compliance environments, delving into the practical realities of integrating AI into enterprise workflows and observability systems.

View Video

#057 - From Pagers to Pair Programming: Navigating Massive Scale and AI with Stefana Muller (Sale...

May 4, 2026 | By Komodor

In this episode of "Kubernetes for Humans," Stefana Muller, VP of Infrastructure & Operations at Salesforce, shares her fascinating journey from technical support to navigating the massive scale of the Own Backup acquisition. Stefana dives into the immense multi-cloud Kubernetes challenges of scaling from 18,000 to over 52,000 clusters, standardizing environments across AWS and Azure, and leveling up security to meet stringent Salesforce standards.

View Video

#056 - Cloud Contradictions and Cautionary Tales with Corey Quinn (The Duckbill Group)

Apr 30, 2026 | By Komodor

In this episode of the Kubernetes for Humans podcast, Itiel sits down with the internet's favorite cloud contrarian, Corey Quinn of the Duckbill Group. Corey shares his unconventional career path as a "cautionary tale," explaining why his knack for fixing horrifying AWS bills makes him a terrible employee, and why he absolutely refuses to touch Kubernetes in production.

View Video

[DEMO] Komodor powered by Klaudia: Autonomous AI SRE Platform for Cloud-Native Infrastructure

Apr 9, 2026 | By Komodor

View Video

#055 - From Enterprise Java to Kubernetes and AI-Driven Infrastructure with Dan Hicks (Boomi)

Apr 1, 2026 | By Komodor

Dan breaks down the fundamental similarities and stark differences between application development and platform engineering. He shares the unexpected hurdles he faced during his transition, from complex networking and CoreDNS latency to the harsh realities exposed by chaos testing in cloud environments.

View Video

#054 - From Shiny Objects to FinOps: Taming Cloud Costs in the AI Era with Josh Schlanger (CloudX...

Mar 26, 2026 | By Komodor

In this episode of the Kubernetes for Humans podcast, we are joined by infrastructure and FinOps expert Josh Schlanger. Drawing on over 15 years of experience across Martech, e-commerce, and health tech, Josh shares why solving core business problems should always take priority over chasing new, "shiny object" technologies.

View Video

[Webinar] Conquering the Complexity of Self-Hosted Apps with Agentic AI SRE

Feb 26, 2026 | By Komodor

Most enterprise SaaS products, like Komodor’s Autonomous AI SRE Platform, require installing a remote agent on the customer’s infrastructure, which varies significantly from one organization to another, in terms of architecture, configurations, permissions, processes, and more. This “unmanaged” model creates major blind spots, making daily operations, observability, debugging, and incident response challenging. When failures occur, limited visibility and bespoke systems make root-cause analysis slow, incomplete, or impossible.

View Video

#053 - The Road to Distributed AI and Kubernetes Infrastructure with Matt Butcher (Fermyon) & Ari...

Feb 13, 2026 | By Komodor

They share their professional origins, highlighting how Kubernetes transitioned from a complex tool for experts to a foundational technology for global enterprises.. Part of the conversation focuses on the history of Helm, explaining its growth from a simple hackathon project into a standard package manager. Another part takes on the future of distributed computing, specifically how Akamai is integrating infrastructure as a service to support modern workloads.

View Video

Monthly Archive

Follow Us