Tel Aviv, Israel
2020
  |  By Komodor
Kubernetes is an open-source container orchestration engine that is used to automate containerized application deployment, scaling, and administration. It is an open-source management platform that can be used to manage containerized workloads and services, as well as declarative configuration and automation. Kubernetes is a framework for running distributed systems in a resilient manner. It handles scaling and failover for your application and provides deployment patterns and other features.
  |  By Snir Amsalem
SREs and platform teams are overwhelmed by the effort of manually troubleshooting ever-more complex cloud-native environments. This pain is driving a breakneck adoption of AI SRE solutions that promise to automate core reliability practices, from root cause analysis to capacity planning. For teams with strong engineering talent, creating a DIY AI SRE seems like a straightforward challenge.
  |  By Komodor
Virtual event will explore what it takes to use AI in production SRE, from incident response and observability to platform design, cost control and self-healing operations TEL AVIV and SAN FRANCISCO, April 22, 2026 — Komodor, the autonomous AI SRE company, today announced it will host AI SRE Summit 2026, a free live virtual event on Tuesday, May 12, 2026, bringing together site reliability, platform engineering and cloud-native leaders to discuss how AI is changing production operations, and where i
  |  By Danielle Inbar
Platform Engineering leaders are caught between two competing imperatives. You’re under pressure to flatten cloud spend but your team is still provisioning defensively because nobody wants to be the person who causes a production incident. You try to optimize, but six months later, when someone pulls a report, nothing has changed.
  |  By Asaf Savich
Cluster API (CAPI) is transforming how organizations deploy and manage fleets of Kubernetes clusters by introducing declarative, Kubernetes-style APIs to automate cluster provisioning and lifecycle management. While CAPI excels at creating consistent and repeatable cluster deployments across different infrastructure providers, operating it at a massive scale introduces unique day-to-day challenges.
  |  By Itiel Shwartz
Once upon a time, a monolith running on a handful of servers meant that incident management, even at 2:17 AM, was something a single generalist could handle. One person with enough context across the stack could reasonably diagnose whether the database was choking, a config had changed, or a server was running hot. They’d fix it and go back to sleep.
  |  By Komodor
Out-of-the-box and bring-your-own AI agents that encode operational knowledge boost troubleshooting speed and accuracy across cloud native infrastructure TEL AVIV and SAN FRANCISCO, March 18, 2026 — Komodor, the autonomous AI SRE company for cloud-native infrastructure, today announced a new extensibility framework that transforms its Klaudia AI technology into a universal multi-agent platform for troubleshooting and optimizing performance of complex cloud native infrastructures and applications.
  |  By Ilan Adler
A FinOps analyst walks into a Monday morning meeting with a detailed spreadsheet showing $2.3M in potential Kubernetes cost savings. The recommendations look straightforward: reduce memory limits by 40%, scale down replicas during off-peak hours, consolidate workloads onto fewer nodes. The numbers are compelling, the methodology is sound, and the savings would make a material impact on quarterly cloud spend. The SRE team immediately objects.
  |  By Itiel Shwartz
Kubernetes troubleshooting traditionally requires deep platform expertise. Understanding pod lifecycle, decoding error messages, correlating events across resources, and identifying root cause all demand experience that takes years to build. This expertise gap creates a bottleneck where only senior engineers can handle production issues, limiting how quickly teams can resolve incidents.
  |  By Ilan Adler
This is part two of a series of the implications of AI generated code becoming mainstream. We recently wrote about how AI-generated code is overwhelming SRE teams with production complexity they can’t manage. Turns out that’s only half the problem. The other half shows up on the cloud bill. A prospect reached out to us last month. They’d been using Cursor and Claude Code for six months, shipping features at unprecedented velocity. Product was thrilled.
  |  By Komodor
In this episode of the Kubernetes for Humans podcast, we are joined by infrastructure and FinOps expert Josh Schlanger. Drawing on over 15 years of experience across Martech, e-commerce, and health tech, Josh shares why solving core business problems should always take priority over chasing new, "shiny object" technologies.
  |  By Komodor
Most enterprise SaaS products, like Komodor’s Autonomous AI SRE Platform, require installing a remote agent on the customer’s infrastructure, which varies significantly from one organization to another, in terms of architecture, configurations, permissions, processes, and more. This “unmanaged” model creates major blind spots, making daily operations, observability, debugging, and incident response challenging. When failures occur, limited visibility and bespoke systems make root-cause analysis slow, incomplete, or impossible.
  |  By Komodor
They share their professional origins, highlighting how Kubernetes transitioned from a complex tool for experts to a foundational technology for global enterprises.. Part of the conversation focuses on the history of Helm, explaining its growth from a simple hackathon project into a standard package manager. Another part takes on the future of distributed computing, specifically how Akamai is integrating infrastructure as a service to support modern workloads.
  |  By Komodor
In this episode, Itiel joins forces with Shemer, Director of Platform Solutions at the gaming giant Playtika, and Scott Rosenberg, Lead Architect at TeraSky, to discuss the realities of platform engineering at a massive scale. The trio dissects Playtika’s multi-year journey from a legacy, homegrown Kubespray infrastructure to a modern, holistic platform built on Spectro Cloud, all while running strictly on-premise to support 25+ games and high-volume traffic.
  |  By Komodor
From the early days of "neural nets" and WebSphere to the modern complexities of Kubernetes, Hayato Shimizu has seen the evolution of infrastructure firsthand. In this episode of Kubernetes for Humans, the co-founder of Digitalis joins the show to discuss the harsh realities of enterprise platform engineering and his personal journey from corporate employee to consultancy owner.
  |  By Komodor
Watch as Itiel Shwartz, Komodor CTO and Co-Founder as he shares hard-won lessons from developing an AI agent that processes millions of K8s events daily to deliver autonomous troubleshooting that reached 95%+ accuracy in benchmarking. This webinar covers: Building production ready systems that maintain reliability when 90% of your data is noise. How Komodor developed an AI SRE agent that processes millions of K8s events daily to deliver autonomous troubleshooting that reached 95%+ accuracy in benchmarking.
  |  By Komodor
Join Hasith Kalpage, Director of Platform Engineering , and Arthur Drozdov, Agentic AI Engineer, as they share how Cisco is using Komodor’s Klaudia Agentic AI to evolve its platform strategy, to unlock smoother developer experience, slash MTTR, and reduce bottlenecks across the enterprise. – Including a live demo of the CAIPE platform!
  |  By Komodor
Komodor is the leading Autonomous AI SRE Platform for cloud native infrastructure and operations. Powered by Klaudia Agentic AI, Komodor automatically visualizes, troubleshoots, and optimizes Kubernetes-based platforms at scale.
  |  By Komodor
In this episode Itiel hosts Veeam experts Julia and Michael, to share their distinct paths into cloud-native technology. Julia discusses her transition from a background in law and marketing to becoming a CNCF ambassador and AWS container hero. Michael, a veteran who has been with Veeam for over 10 years, details his traditional CIS admin background (virtualization, storage) and the evolution of this role into platform engineering.
  |  By Komodor
In this hands-on workshop, we covered how to build your own MCP server from scratch and connect it to AI tools like Cursor IDE or Claude Desktop. The first half is a live coding session you can follow along with to set up an MCP server for Kubernetes troubleshooting. In the second half, we take you behind the scenes at Komodor to show how we built our MCP Server MVP: a powerful bridge between AI assistants and Kubernetes infrastructure. This is just part of the 'magic' that helps the Klaudia agentic AI technology power Komodor's AI SRE Platform.
  |  By Komodor
The Handbook for Kubernetes Errors is an essential guide to understanding and resolving all of the most common Kubernetes issues, including: and many more.

Komodor tracks changes across your entire K8s stack analyzes their ripple effect and provides you with the context you need to troubleshoot efficiently and independently.

Today’s microservices systems are complex, distributed and they are constantly changing. Keeping track of so many moving parts in so many places often seems nearly impossible! Komodor is the missing piece in your DevOps toolchain – offering one unified platform from which you can gain a deep understanding of all of your system events, changes and their effect.

Turning troubleshooting chaos into clarity:

  • Empower on-call teams: Make the knowledge and expertise that has traditionally been held by only a few, clear and visible to Dev and SRE teams.
  • Understand your Kubernetes: Gain the K8s visibility you are lacking. See your deployments on a timeline with the relevant information: what changed, what code was pushed and by whom.
  • Track your system end-to-end: View data from your Git, config, infra, alerting and other tools, in one centralized and easy-to-understand display.
  • Uncovering the context: Troubleshoot your microservices based on the most relevant context, connections and dependencies.

Keep building. We’ve got your back.