Operations | Monitoring | ITSM | DevOps | Cloud

Kubernetes for AI Workloads

Kubernetes has been facilitating container orchestration for around a decade for both stateful and stateless application workloads. With the recent rise of AI and the advent of tools like Kubeflow and Argo Workflows, Kubernetes is also becoming a first-class citizen when it comes to running AI workloads. When you are training a model on K8s, you may be tweaking many parameters and have to test each of them one by one.

#036 - Beyond Kubernetes: A Radical Vision for the Future of Infrastructure with Adam Jacob (Syst...

Adam Jacob, CEO of System Initiative and original author of Chef, discusses the evolution of infrastructure automation and his career-long passion for infrastructure. Jacob reflects on the history and context of Chef, its emergence alongside EC2, and its role in configuration management. He shares insights into the competitive landscape of configuration management tools like Chef, Puppet, and Ansible, and touches upon the transition of Chef to Progress.

The AI Model Showdown - LLaMA 3.3-70B vs. Claude 3.5 Sonnet v2 vs. DeepSeek-R1/V3

Following all the hype and bluster with DeepSeek’s arrival in the AI landscape––and its ability to crash the poster child of AI’s share value overnight (Nvidia), we wanted to conduct a rigorous evaluation at Komodor. We tested DeepSeek’s models head-to-head against industry leaders in solving real-world Kubernetes challenges.

#035 - Beyond Kubernetes: A Veteran of the Container Wars on the Past, Present, and Future of Clo...

This episode of "Kubernetes for Humans" features Dan Ciruli, a Senior Director of Product Management at Nutanix, who shares his journey in tech and his perspective on the evolution of cloud-native technologies. Ciruli discusses his early career as an engineer and his transition to product management, noting that the role was not well-defined in the 1990s. He recounts his experiences with startups, Google, and D2IQ (formerly Mesosphere), highlighting the rise of Docker and projects like Mesos.

Managing External-DNS & cert-manager with Komodor

Recently we’ve explored the evolving role of Kubernetes as a full ecosystem, rather than just a platform, diving into the power and complexity of add-ons. These tools, as highlighted previously, are key to augmenting Kubernetes core capabilities, and adding-on (as their name implies) essential capabilities not supported directly by Kubernetes itself.

Simplifying DNS Automation with ExternalDNS and cert-manager

Managing DNS records in Kubernetes at scale is complex, especially as clusters grow and the number of applications increases. Enter ExternalDNS—a tool designed to automate DNS record synchronization with Kubernetes resources, providing the agility and scalability needed for modern application environments.

#034 - Infrastructure Automation & the Future of Ops with Cory O'Daniel (Massdriver)

This podcast features Cory O'Daniel, CEO of Massdriver, an infrastructure automation platform. O'Daniel discusses his extensive background in software engineering and cloud operations, highlighting his expertise in Erlang and Elixir programming languages and their applications within Kubernetes. He explains Massdriver's role in simplifying infrastructure management for both developers and operations engineers by visually representing infrastructure-as-code.

Mastering Multi-Cluster Kubernetes Certificate Management with cert-manager

Managing TLS certificates in Kubernetes is no small feat, and the complexity only grows when you’re dealing with multiple clusters. Ensuring secure communication, automating certificate renewals, and integrating with external Certificate Authorities (CAs) are just a few of the challenges Kubernetes administrators, DevOps engineers, and security professionals face.