Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Faster, compliant delivery on regulated cloud with Upsun and IBM Cloud for Financial Services

We are continually enhancing our offering to support enterprises looking to modernize without the pain of modernization. We partnered with IBM to bring our highly flexible cloud application platform to the IBM Cloud Marketplace to give financial service organizations a cloud option that meets both workload and organizational requirements.

Build vs Buy IaC: Choosing the Right IaCM Strategy | Harness Blog

Have you ever watched a “temporary” Infrastructure as Code script quietly become mission-critical, undocumented, and owned by someone who left the company two years ago? We can all related to a similar scenario, if not infrastructure-specific, and this is usually the moment teams realise the build vs buy IaC decision was made by accident, not design.

How to Scale GitOps Without Hitting the Argo Ceiling | Harness Blog

The Argo ceiling is a predictable scaling challenge, not a failure of Argo CD or GitOps. As clusters and teams grow, visibility, governance, and orchestration fragment without a control plane. Script-heavy workflows and manual processes slow delivery and increase risk at scale. A GitOps control plane enables unified visibility, structured workflows, automated guardrails, and secure secret management. GitOps has become the default model for deploying applications on Kubernetes.

Kubernetes Cost Traps: Fixing What Your Scheduler Won't | Harness Blog

Kubernetes cost overruns usually come from small, invisible scheduling decisions—not the platform itself. Over-provisioned requests, poor bin packing, and fragmented node pools quietly waste cloud spend. Cost-aware scheduling, right-sizing, and smarter node selection can deliver major savings without hurting performance. Treat cost as a first-class metric with visibility into why scaling decisions happen—not just when.

Gemini Cost Per API Call in 2026: What You'll Actually Pay (And How to Control It)

On paper, Gemini pricing looks straightforward. You pay per token. Input tokens cost one amount, output tokens cost another, and different models come with different rates. But once Gemini is wired into a production SaaS product, that simplicity disappears. Fast. That’s because token usage compounds across context, retrieval, and output — not across requests. The same “API call” can cost pennies in one feature and dollars in another.

Webinar (Jan 15 2026): Take Back Control of Your Infrastructure (feat. nvisia)

Learn how leading teams are reducing complexity, controlling costs, and building resilient environments with modern private cloud patterns.. What we covered: If you’re evaluating private cloud, hybrid infrastructure, or looking to take back control of your infrastructure in 2026, this session provides a clear, actionable starting point. Reach out to our team to learn more today!

AI SRE in Practice: Resolving Node Termination Events at Scale

When a node terminates unexpectedly in a Kubernetes cluster, the immediate symptoms are obvious. Workloads restart elsewhere, services experience partial outages, and alerts fire across multiple systems. The harder question is why it happened and how to prevent it from recurring. This scenario walks through a node termination event where the entire node pool was affected, requiring investigation across infrastructure layers to identify root cause and implement lasting remediation.

AI Hosting: The Colocation vs. Cloud Dilemma for Your Next Project

Organisations running AI workloads, like banks training fraud detection models, hospitals testing diagnostic tools, or manufacturers using predictive analytics, all face the same problem: hosting them is costly and resource-intensive. They require dedicated GPUs running non-stop, vast amounts of data moving in and out, and far more power and cooling than a typical IT system.