Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Streamline Your Development Workflow with Bunnyshell: Achieve Faster Time-to-Market

In today’s fast-paced software development landscape, maintaining consistent and reliable environments across all stages—whether it’s development, testing, or production—is crucial. The "works on my machine" problem is all too familiar, leading to inefficiencies and delays that can derail your projects. Enter Bunnyshell, a game-changer in the world of environment management that can transform your development workflow and drastically accelerate your journey from code to production.

Reliability-Driven Fleet Management with Komodor

Maintaining a few K8s clusters is hard enough. Maintaining 1000+ clusters is virtually impossible without embracing new tooling and paradigm shifts. Join us for an insightful LIVE workshop exploring the possibilities of Kubernetes Fleet Management with Komodor, lead by Itiel Shwartz* In this session, we will dive into the challenges of multi-cluster management and how Komodor's comprehensive platform simplifies operations. Discover how to gain real-time visibility into your clusters, automate routine tasks, and troubleshoot issues across your entire fleet efficiently.

Back to the Basics: The Foundational Role of DDI in Any Network

In the ever-evolving landscape of networking, there are a plethora of three-letter acronyms that make up the wonderful alphabet soup that is a part of every engineer’s vocabulary. Whether it’s TCP, UDP, SSH, or one of the many other dozens, one acronym is commonly left out of the discussion: DDI. These seemingly simple letters are often overlooked or rarely thought of, but they are a crucial foundation for managing a stable, secure, and efficient network.

Reward engineers who fix problems before they cause outages

Are you recognizing the good work engineers do to prevent outages? "The people that are out there doing good work to prevent fires from ever occurring, we're not often recognizing them. We're not often rewarding them. And once things go wrong, someone comes in and fixes it. That's great. That's needed. But we're rewarding that behavior. And so it becomes a bit of people are motivated by what behavior you reward.

Migrating from SVN to Git: Step-by-Step Guide

Article updated June 2024 Is your current Subversion (SVN) version control system not meeting the needs of your development team? Perhaps you’ve heard of Git, but you’re so entrenched in SVN that converting to a new version control system seems like a daunting task. Fear not! No task is insurmountable when you have the power of the legendary GitKraken Desktop on your side.

The Definitive Guide to Kubernetes Cluster Upgrades

Kubernetes continues to play a pivotal role in orchestrating containerized applications with its cloud-native capabilities. Of course, capabilities like flexibility and scalability mean organizations must be extra vigilant, especially when it comes to maintaining the health and efficiency of Kubernetes clusters.

Running ML/LLM models on Kubernetes Across Major Cloud Providers with Abhishek Choudhary

Abhishek, co-founder and CTO of @truefoundry, explores the complexities of building a machine learning platform on Kubernetes. Discover solutions to challenges like handling diverse hardware, managing large Docker images, and optimizing costs. Learn how True Foundry uses tools like Argo CD, Keda, and Istio to create efficient abstractions for data scientists and streamline ML operations.

Managed Apps on Public Cloud: Why Operations Matter, Part I

You might be tempted to think that running an app on a public cloud means you don’t need to maintain it. While that would be wonderful, it would require help from the public cloud providers and app developers themselves, and possibly a range of mythological creatures with magic powers. This is because any app, regardless of the infrastructure on which it runs or its output, requires maintenance in order to yield accurate and reliable outputs.