Operations | Monitoring | ITSM | DevOps | Cloud

Sponsored Post

Getting to the Zero Engineers Code Development Moment

The software world is inching rapidly toward an era once thought impossible: a time when no engineers are needed to write code. Not because software will disappear, but because the tools writing the code will be intelligent, autonomous, and capable of reasoning, generating, and deploying entirely on their own. We're not there yet-but we're getting very close.

What Is Azure SQL?

Modern cloud solutions mean much more than just data storage. Cloud technologies cover virtual services, including analytics, databases, networking, servers, and storage via the internet. Such a giant as Microsoft is among the most prominent cloud providers with its Azure platform. “Platform as a Service,” or PaaS, is a popular solution for database specialists. It is a powerful database engine that allows you to perform most database management routines.

The $465M Bug: What Every Dev Needs to Know About Code Health

Is your codebase quietly killing developer productivity? In this talk from GitKon, Jai Predeesh (co-founder of DeepSource) breaks down why code health is more than a buzzword... it’s a critical lever for preventing failures, improving dev morale, and scaling without firefighting. From Knight Capital’s $465M bug to subtle security flaws in AI-generated code, this session shows how automated static analysis can catch the issues that escape human eyes and PR reviews.

A Guide To Azure Database Pricing (And Reducing Costs)

You spun up Azure SQL for your app backend, added Cosmos DB for global performance, and let your devs explore PostgreSQL freely. Everything worked — until the invoice hit. Your engineers need high availability and performance. Your CFO wants predictability. And you’re stuck trying to untangle what, exactly, is driving your Azure bill. You’re not alone. Between service types, pricing tiers, and throughput models, Azure database pricing can surprise even experienced teams.

Enterprise Drupal: why hosting choice impacts your success

Deploying a large-scale Drupal application involves numerous decisions, and one of the most significant is where to host it. Should you keep everything on-premises, use a managed hosting provider, or opt for a modern PaaS like Upsun? Each approach has trade-offs. In this article, we’ll dive into the real-world technical pitfalls of on-premises and managed hosting, and then explore how Upsun addresses those challenges in an enterprise Drupal context.

Get Ready for the Next Level of Gaming!

Recorded at Civo Navigate Austin 2025, join industry experts from Perforce, Gametree, and Streamfog as they share their insights on the current state of technology in gaming, cloud-native technologies, and the future of tech in gaming. This panel discussion explores the latest trends and innovations in game development, AI-powered gaming, and emerging business models, revealing how the gaming industry is leveraging cloud-native technologies, AI, and other cutting-edge tech to drive innovation and growth.

Kubernetes Cost Optimization Done Right

Kubernetes was never just about cost savings. It was built to be a robust, scalable, and efficient platform for orchestrating containerized applications. And it was meant to abstract infrastructure away so developers could move quickly and go about their business of developing. But as Kubernetes adoption scaled, so did cloud bills. FinOps tools emerged to rein in spending, but most only scratch the surface.

NAT Management Made Easy: See Every Translation

Network Address Translation (NAT) is foundational in today’s enterprise and cloud networks, yet NAT documentation often remains an afterthought, managed in spreadsheets or skipped entirely. This creates unnecessary complexity, security blind spots, and wasted IP resources. At LightMesh, we believe IP address management (IPAM) should include intuitive tools to track NAT configurations just as easily as subnets and IP assignments.

Spike vs. PagerDuty: Which On-Call Management Tool Is Better in 2025

If you’re stuck between choosing Spike vs. PagerDuty for your on-call management, you’re at the right place. I wrote this blog post to end your confusion and help you make a better choice. I’ve presented a comparative analysis for these two tools across 4 key criteria (keep reading to find what they are). For each criterion, there’s either a winner or a tie. When it’s a tie, each tool gets one point. If there’s a winner, that tool gets two points.

Prometheus Gauges vs Counters: What to Use and When

Choosing the wrong metric type in Prometheus can lead to inaccurate dashboards, false positives in alerting, and missed indicators of system failure. Gauge metrics are intended for tracking values that can go up and down, such as memory usage, queue depth, or the number of active connections. Unlike counters, which only increment (or reset on restart), gauges reflect the current state of a resource at scrape time.

A Quick Guide To Kubernetes Observability

Many companies are rapidly adopting cloud-native computing services, like containers, microservices, and serverless computing. Unlike monolithic applications, these technologies rely on distributed architectures. Whether you are running them in the cloud, on-premises, or both, distributed systems consist of thousands or millions of processes and components. The challenge now is to make these complex systems’ inner workings visible, controllable, and improvable.

Event Intelligence Solutions: The Essential Tools Every ITOps Manager Needs - and How Interlink Software Delivers

david.arrowsmith • June 27, 2025 IT Operations (ITOps) managers need to ensure always-on availability across a more complex and hybrid ecosystem than ever before. Event storms, patchwork toolchains and slow root cause analysis (RCA) impede responsiveness and undermine the high digital performance customers demand. The Event Intelligence and Service Observability Platform from Interlink Software addresses this.

How Puppet is Redefining Infrastructure Management with AI, Powered by Perforce Intelligence

AI has emerged as a defining force in modern technology, spearheading transformation across industries. Yet, despite its promise to revolutionize workflows and unlock unprecedented efficiency, most DevOps organizations face significant hurdles in adopting AI safely and effectively. Concerns about complexity, scalability, and governance hold many decision makers back.

How we're shipping faster with Claude Code and Git Worktrees

Four months ago, Claude Code was announced and we were requesting invites to its "Research Preview." Now? We've gone from no Claude Code to simultaneously running four or five Claude agents, each working on different features in parallel. It sounds chaotic, but it's been a natural progression as we've learned to trust AI more and as the tools have dramatically improved.

CPU monitoring for network admins: Why it matters more than ever

In your role as a network administrator, maintaining smooth, uninterrupted system performance isn’t just a one-time task; it’s your daily mission. Whether you're managing hundreds of endpoints, virtual machines, or hybrid cloud environments, CPU monitoring is one of the most critical tools in your toolkit. Without it, diagnosing performance slowdowns, service lags, or outages becomes reactive guesswork.

13: Effective Resource Optimization and Kubernetes Insights with Daniele Polencic

Kubernetes, container resources, request and limits, sizing, the impact of getting things wrong, CPU limits, JVMs, HPA and VPA, does Karpenter fix the request and limit problem? We’ve got a great episode for you today! Thanks for joining us on Densify Talks! We welcome Daniele Polencic, one of the lead instructors at LearnK8s, which specializes in containers and Kubernetes technologies.

The Artifact Management Market Is Up For Grabs

The enterprise artifact management market - which has belonged for a while to JFrog and Sonatype - is now truly up for grabs. Cloudsmith was built on the core principle that cloud-native architecture matters. So does simplicity in design and workflow. Partnerships matter, too. We’ve built a comprehensive platform that controls and secures every artifact as it’s built, scanned, signed, stored, and shipped across the software supply chain.

Hyperview DCIM 5.0 Software Release: AI Assistant

Hyperview 5.0 introduces a powerful new AI Assistant (Beta) to help users navigate the platform, run reports, and complete tasks faster. The release also features a redesigned search experience with improved speed, shortcuts, and a docking option for multitasking. Additional enhancements include streamlined asset editing, smarter alerting rules, expanded auto-discovery support, and new API endpoints — all aimed at improving efficiency, accuracy, and user experience across the board.

How AI + Automation Are Paving the Way for Autonomous Networks

Network management teams are drowning in alerts, tickets, and manual escalations—slowing MTTR, driving up costs, and jeopardizing service quality. But the rise of AI-driven automation is changing the game. In this webinar replay, experts from Grokstream and Resolve will discuss how AIOps and automation are shaping the future of autonomous networks. Join Josh Kindiger and Ari Stowe to see firsthand how AI can reduce noise, detect root causes, and trigger end-to-end remediation without manual effort.

Self-hosted runners vs cloud CI/CD: A complete decision guide

Your CFO just asked about operational efficiencies across the engineering org. Tooling budgets are under the microscope, and suddenly CI/CD costs are getting attention. Sound familiar? When the pressure’s on to cut software spend, CI/CD often looks like a tempting target. It’s visible, measurable, and seemingly easy to move.

Prometheus and CloudWatch Integration for AWS Metric Collection

The Prometheus CloudWatch exporter pulls AWS CloudWatch metrics into your Prometheus setup, giving you a unified view of your infrastructure alongside application metrics. If you're already running Prometheus and need visibility into AWS services like EC2, RDS, or Lambda, this exporter handles the integration without forcing you to switch monitoring stacks.

Common Drupal deployment pitfalls and how cloud automation fixes them

Deploying a Drupal 10 or Drupal 11 application can feel like walking through a minefield of potential problems. Even a well-built Drupal site can stumble during deployment due to a few common pitfalls. For IT managers and technical project leads, understanding these pitfalls—and how cloud automation can solve them—means fewer late-night emergencies and smoother launches.

Getting Started with Puppet Infra Assistant: A Complete Guide

Managing today's complex enterprise infrastructure presents significant challenges — from siloed data and steep learning curves to time-consuming troubleshooting. As the pace of business accelerates and infrastructure demands grow, these obstacles are increasingly difficult to overcome. That’s why we built Infra Assistant, a new AI capability in Puppet Enterprise Advanced, powered by Perforce Intelligence.

Engineering Excellence in the Age of AI: It's Not Dead, It's Maturing

On a recent episode of The Product Manager podcast, Cortex CEO Anish Dhar joined host Hannah Clark to challenge a growing narrative: that software engineering is obsolete in the age of AI. His take? Engineering isn’t disappearing, it’s maturing. At Cortex, we work with some of the most forward-thinking engineering organizations at companies like Canva and Fanatics.

5 Best On-Call Scheduling Software (Reviewed & Ranked)

Looking for the best on-call scheduling software for your team? Or maybe you’re exploring alternatives to your current tool? Signing up for different on-call tools and testing them all takes weeks. That’s a lot of time you probably don’t have, especially when you need reliable on-call coverage now. That’s why I did the heavy lifting for you. I signed up for and tested the 5 popular on-call scheduling tools in the market: Spike, PagerDuty, Incident.io, Splunk Oncall, and OpsGenie.

The Cost of Waiting: Why Operationalizing AI in IT Can't Be Delayed Any Longer

Most IT leaders already understand that AI is the future of operations, but too many are still treating it like it’s so far off. The irony? Waiting is exactly what’s costing them the most. While businesses obsess over budget cuts, resource constraints, and service quality, one truth remains: delays in adopting AI for IT operations are compounding operational inefficiencies, inflating labor costs, and stalling digital progress. AI isn’t just about innovation; it’s about scale.

How to Convert MS Access Database to MySQL

Microsoft Access is a relational system for managing databases that is used to create small-scale databases for a single user or small teams. MySQL is a robust open-source relational database management system for more extensive data volumes and web applications. With the help of dbForge Studio for MySQL, you can easily migrate data from Microsoft Access to MySQL and preserve data and functional integrity.

Navigating Shopware logs and slow pages in a real world scenario

A Shopware store goes from smooth to sluggish—pages take 10 seconds to load, even longer in some cases. What happened? In this post, we tell the true story of how one overlooked plugin setting nearly collapsed a storefront, and how it was resolved using native tools. If you’re shipping code in Shopware without clear performance observability, this is your wake-up call. Everything was working, until it wasn’t.

Enterprise Drupal: Why hosting all your apps on one platform matters

For many enterprises, Drupal has been the backbone of their web operations for years. It’s a battle-tested CMS that handles complex content needs with elegance. But business needs have evolved. Today, it’s rare for a company to rely only on Drupal. They are spinning up Python APIs, .NET backend services, Node.js apps, Java microservices — expanding their digital ecosystems around Drupal’s core.

SwiftPM, CocoaPods, and the Future of Enterprise Development for Apple Platforms

Swift is the default and preferred language for developing applications within the Apple ecosystem. The Swift Package Manager (SwiftPM) has become the de-facto dependency manager for Swift, enabling developers to share and reuse code effortlessly. While its elegance lies in its simplicity, there’s a common concern about integrating SwiftPM into robust, enterprise-grade development workflows. This is where JFrog Artifactory shines.

GPU Powerhouse: Scaling an AI Cloud in the Heart of Europe

The AI revolution needs more than models - it needs massive infrastructure. And Julien Gauthier is building it. In this episode of Uplink, Julien, CEO of Arkane Cloud, joins host Michael Reid to unpack how his company scaled from 3D rendering and gaming to delivering GPU cloud services for AI workloads across the globe. We explore how Arkane built a 1,000-GPU cluster in Paris (with capacity for 6,000), the rise of inference workloads in Europe, and the real-world engineering and business challenges of deploying high-density infrastructure - including cutting-edge liquid cooling handling 135kW per cabinet.

Puppet Infra Assistant: AI-Powered Natural Language Queries

Finding critical infrastructure insights shouldn't be a game of hide-and-seek. The new AI-powered Infra Assistant is a natural language interface that allows users of any skill level to chat with Puppet data and services for quick insights and reporting on infrastructure state. You don't need any Puppet experience to get started; it's safe to use in your infrastructure; and it's secured with explicit opt-in and robust role-based access control.

Setup Guide: Infra Assistant for Natural Language Puppet Queries

Learn how to get started with Puppet Infra Assistant quickly with a step-by-step guide and examples. Infra Assistant makes it easier to interact with Puppet data and services with a natural language interface. It's "bring your own key" with support for OpenAI and Azure OpenAI (with more coming soon), and secured with opt-in requirements and role-based access control.

6x Developer Velocity: Intuit's Secret to Unlocking Innovation

Join Jimil Patel, Head of Technical Product Marketing and Developer Advocacy at Intuit, as he shares the company's transformative journey from cloud-native to AI-native, resulting in a 6x increase in developer velocity. Recorded at Civo Navigate Austin 2025, this talk explores Intuit's Modern SaaS AIR platform, AI-powered developer tools, intelligent auto-scaling, and AIOps-driven operations. Discover how Intuit is redefining the future of software development with real-world examples, including IKS AIR for self-healing runtimes and AI-driven observability, which cut MTTR by 50%.

Security and Compliance Takes Center Stage: Key Insights from Open Source Finance Forum - London 2025

We’ve just wrapped up London’s 2025 Open Source Finance Forum (OSFF) in London and in this blog I’ll try to capture the key highlights from this year’s event while they’re still fresh. Dominant themes were the increasing prominence of legislation and governance frameworks, and what these mean for developers and practitioners.

Is it time to switch CI/CD platforms? 7 warning signs

Every engineering team eventually faces this question: “Is our CI/CD setup actually helping us, or is it getting in the way?” The answer isn’t always obvious. CI/CD problems often develop gradually: small issues become accepted workarounds, and those workarounds become standard practice. What once worked well for your team might not fit your current needs or scale. The decision to evaluate new tooling usually builds over time as pain points accumulate and priorities shift.

32 Best FinOps Tools For 2025: Features And Comparison

In recent years, cloud financial management has evolved beyond what many cloud stakeholders anticipated. The overwhelm has led too many companies to struggle to accurately monitor, allocate, and optimize their cloud costs. This issue cost companies about 30% of their cloud budgets in 2022 alone, according to Gartner. With FinOps, you can prevent this bleeding without sacrificing innovation. Yet, taking a manual approach to FinOps can be inefficient and error-prone.

#046 - Simulating, Scheduling, and Saving: Optimizing Kubernetes with David Morrison (Applied Res...

In this episode, Itiel has an insightful conversation with Dr. David Morrison, a research scientist and founder specializing in Kubernetes scheduling and autoscaling. David shares his journey from operations research to leading distributed systems efforts at tech giants like Yelp and Airbnb. Learn about the transition from Apache Mesos to Kubernetes at Yelp, including the role of their open-source API layer, Pasta.

Kubernetes Costs: More Than Meets The Eye

As organizations expand their Kubernetes deployments and scale production workloads, effective cost management becomes an essential priority. The rapid innovation demanded from development teams often intersects with a shortage of advanced Kubernetes expertise, leading to resource inefficiencies and unnecessary expenses. This challenge is further amplified by the growing prevalence of AI/ML workloads and the intricate demands of GPU utilization.

How to Deploy Helm Charts on Kubernetes the Easy Way with Qovery

Deploying Helm charts on Kubernetes can be complex, especially when dealing with configuration overrides, security, and environment-specific setups. In this article, we show how Qovery simplifies Helm chart deployment through a seamless developer experience, robust security defaults, and powerful automation, without sacrificing flexibility.

Amazon SQS Metrics: Monitor, Debug, and Optimize Your Message Queues

Message queues quietly take care of a lot—buffering workloads, smoothing traffic spikes, and keeping services connected. But they don’t always get much attention until something feels off. Amazon SQS offers a solid set of metrics to help you understand how your queues are doing, whether you’re scaling well or nearing limits. This blog breaks down the key SQS metrics: where to find them, what they mean, and how to respond when things start to shift.

How to Configure Docker's Shared Memory Size (/dev/shm)

Your Node.js app runs fine on your machine. But inside Docker? You start getting weird crashes—ENOSPC: no space left on device. Chrome headless tests fail out of nowhere. PostgreSQL throws shared memory errors under load. The problem? It’s probably /dev/shm, the shared memory volume Docker sets up by default. Most containers get just 64MB of space here.

Windows VPS Hosting in Practice: Insights from the Kamatera Platform

When diving into the world of Windows VPS hosting, it's essential to choose a provider that offers robust performance, flexibility, and reliability. Kamatera stands out as a dominant player in this domain, providing an expansive suite of features tailored to enhance user experiences and system capabilities. By harnessing the full potential of Kamatera's platform, you can streamline your digital operations, ensuring they are both efficient and effective.

11 Best AI Coding Assistants: Top Tools Every Developer Needs in 2025

You’ve heard it before: ‘AI coding assistants aren’t here to replace you.’ And yes, it’s true, they’re not. They’re here to save your brain from 3 AM logic loops and the same bug fixes you’ve solved countless times. As application and database systems become more complex and timelines shrink, forward-thinking developers, data analysts, and DBAs are turning to these tools.

Salesforce Data Integration Tools

Salesforce integration tools are now foundational to enterprise performance. With the average company managing over 1,000 apps, Salesforce must operate as a unified control layer that governs data, workflows, and customer intelligence across the stack. However, in reality most teams never realize this potential. According to Salesforce, 95% of IT leaders face integration challenges, and more than half say these issues directly block them from meeting customer expectations.

Best HubSpot Connector Apps

HubSpot connectors are the backbone of modern CRM operations, marketing, and sales. These powerful tools let you break down data silos, sync information across platforms, and build automated workflows. Whether you’re connecting databases, analytics tools, or custom-built systems, HubSpot connector apps can help you get more out of every lead, campaign, and customer interaction. As great as these tools are, not all will work for you.

Future-Proofing Government IT: Balancing Innovation, Security, and Sovereignty in a Changing World

Get a practical look at how Australian government IT leaders can future-proof infrastructure with flexible, sovereign-aligned connectivity. This blog was originally published on PublicSectorNetwork.com.au on 11th June 2025 and republished with permission. As digital transformation accelerates across the public sector, Australian government agencies face a complex challenge: how to modernize IT infrastructure while safeguarding sovereignty, strengthening security, and maintaining compliance.

Building a Better Cloud: Inside Civo's Vision for What Comes Next

Recorded live at Civo Navigate Austin 2025, Civo CTO Dinesh Majrekar explores how cloud infrastructure is evolving to meet the demands of modern workloads. From rising AI adoption to the need for data sovereignty and cost transparency, Dinesh shares Civo’s vision for a simpler, more efficient, and developer-focused cloud. Learn how Civo is addressing customer challenges around choice, control, and performance and why rethinking how we build and deliver cloud infrastructure is more relevant than ever.

11 Best Log Monitoring Tools for Developers in 2025

Your checkout API just started throwing 500s during peak traffic. You SSH into production, tail logs across six microservices, and realize the database timeout buried in service's logs is causing cascade failures. Two hours later, you've fixed it, but you're thinking: "There has to be a better way." There is. Log monitoring tools centralize logs from your entire stack, making debugging systematic instead of archaeological.

Grafana Cloud: Manage the AWS Observability app as code with Terraform

Imagine setting up your AWS configuration in Grafana Cloud by hand and clicking through menus. When you only have a few services, it’s not a big deal. But as you add more and more, keeping track of every little change becomes a headache. It’s easy to make mistakes, and before you know it, things can get out of sync and your monitoring becomes unreliable.

What's really changing in Microsoft Licensing - A real talk with Alexander Golev

In this episode of the FinOps on Azure podcast, host Michael Stephenson is joined by Alexander Golev from SAM Expert for an honest conversation about what's really changing in Microsoft licensing. With decades of experience in software asset management, Alexander breaks down the shift from traditional licensing to more flexible, consumption-based models -highlighting the role of Azure Arc, the growing complexity of AI licensing, and the importance of collaboration across IT, finance, and procurement.

8 GKE Monitoring Best Practices For Peak Performance

Kubernetes (K8s) is the most popular container orchestration platform today. But it can also be quite complex. To overcome this management challenge, you can deploy your Kubernetes containers using the Google Kubernetes Engine, which is a fully managed service. Yet, to get the most from GKE, you still need to follow best practices. The following tips and best practices for monitoring GKE clusters will help you get started.

Unlocking Cost Optimization Through Full-Stack Kubernetes Visibility

In Kubernetes environments, cost is rarely just about spend. It’s about performance, node utilization, workload behavior, and how all of those align with your team’s operational goals. Komodor’s approach to cost optimization has an operational advantage due to its deep visibility into your entire Kubernetes estate. Imagine the potential for cost optimization when you have complete visibility into every aspect of your Kubernetes operations.

5 Use Cases Requiring Transformative AIOps Tools

As infrastructure grows in complexity, the demand for intelligent, autonomous operations is no longer optional. AIOps tools help IT organizations sift through data noise and detect what matters. But insights alone don’t close the loop. Without automation, even the most accurate prediction still relies on humans to triage, decide, and resolve. This is the gap intelligent automation fills.

The Benefits of Using Juniper's Network Monitoring Tools for IT Operations

More data means more complexities in IT networks. Hence, the right solution is needed to monitor such networks. Many companies struggle without the right tools, and they often lose great business opportunities because they are unable to identify performance-related issues upfront. Network monitoring is thus essential for business success. It helps build healthy network performance, saving companies money in the long run.

A Beginner's Guide To Amazon RDS Pricing

Amazon promotes its RDS as a scalable, high-performing alternative to traditional databases, backed by automation and reliability. It’s popular among teams looking to offload maintenance and improve availability. But while the service is robust, costs can rise quickly depending on how it’s used. This guide explains how Amazon RDS pricing works. In addition, we’ll discuss how to understand, optimize, and view your RDS costs. But first…

Revolutionizing Web Page Creation: How Structured Content is Slashing Design and Development Time

Co-authored with Julie Muzina A year ago, during our Madrid Engineering Sprint, we challenged ourselves to dramatically reduce, or even eliminate, the need for constant design involvement in the day-to-day creation of web pages. Our strategy for achieving this is based on a smarter, more structured approach to content.

Unlocking Developer Productivity: SUSE Application Collection extension for Rancher Desktop

Same as in the community, Enterprise developers need tools that are both powerful and flexible. They need to innovate quickly, iterate efficiently‌ and deploy with confidence. This is where the synergy between Rancher Desktop and SUSE Application Collection truly shines, offering a comprehensive environment for modern enterprise developers.

What Are the Key Benefits of Using DCIM vs. Traditional Tools?

Historically, data center professionals have managed their sites using traditional tools like Excel and Visio. While manual spreadsheets and diagrams served their purpose for simple tasks, they were never designed for the complexity, scale, or speed of modern data center operations. However, Data Center Infrastructure Management (DCIM) software is purpose-built to plan, provision, model, track, and monitor all infrastructure across all sites.

Is Your Data Truly Yours? Why Data Sovereignty in India Matters More Than Ever

As businesses in India embrace the cloud, a critical question looms: Where does your data really live, and who controls it? In 2025 alone, India’s cloud market is projected to reach US$ 21.4 billion, with further growth in 2030 expected to reach US$ 52.2 billion. This helps to underscore the rapidly expanding scale and strategic importance of cloud infrastructure in the country. But with this growth comes growing concern: Is your data secure, compliant, and under your control within Indian borders?

Commit Code, Embrace Vulnerability, Ask Questions, Grow Together

Think they’ll find out you don’t know everything? Good. That means you’re learning. Every great dev was once a beginner who asked the right questions. This clip from our chat with Descript’s Corbin Crutchley and our own Chris G hits hard. GitKraken Desktop: gitkraken.com/git-client GitKraken CLI: gitkraken.com/cli GitLens for VS Code: gitkraken.com/gitlens Git Integration for Jira: gitkraken.com/git-integration-for-jira.

Watch RITA (Resolve IT Agent) fix VPN issues in seconds! #itautomation #ai #agenticai

Say goodbye to ticket chaos. Meet RITA. RITA (Resolve IT Agent) is your intelligent frontline assistant—built to deflect L1 tickets, resolve routine requests instantly, and slash MTTR across the board. In this live demo clip, watch Derek Pascarella, our Global Director of Sales Engineering, show how RITA fixes VPN issues in seconds—no human handoff, no ticket backlog. Ready to fast-track your journey to Zero Ticket IT? This is where it starts.

Insights to keep AI applications reliable

AI has become a massive investment for companies. Engineering teams across industries are integrating AI into their products, whether it’s through homegrown, self-managed models or third-party model integrations. But no matter how much AI shifts the user experience, it’s still an application, which means your engineering team still needs to operate it and keep it reliable. At the same time, AI applications add complexity and complications that require a shift in your approach.

Understanding Playwright test hooks in the CI context (JavaScript) - A complete tutorial

All applications need some form of testing, whether frontend, backend, stress testing, or any other. Playwright can help. Playwright is an end-to-end testing framework for web applications, supporting cross-browser testing (Chromium, Firefox, WebKit) from a single API. Its built-in test runner (Playwright Test) provides hook functions to manage set-up and tear-down logic around your tests.

Validating OS-compatibility for locally-run LLMs using Ollama with CI/CD matrix workflows

Large Language Models (LLMs) are becoming increasingly accessible, with regular adoption of open-source models and the growing ecosystem of tools for running them locally. Compact versions are now able to run on consumer-grade hardware, so developers are using LLMs on personal devices like Linux workstations, macOS laptops, or even Windows machines. As this trend grows, so does the need to ensure that your LLM-powered applications run reliably across all major operating systems.

Evolving deployments in Bitbucket Pipelines: Concurrency Groups and Environments

We’re excited to announce that Bitbucket Cloud is introducing two powerful new features in Bitbucket Pipelines: Concurrency Groups and Environments. These enhancements are part of a broader initiative to make the Deployments functionality more flexible and user-friendly by breaking down its current monolithic structure into smaller, more granular capabilities that you can control directly.

The Future of Auditing is Agentic AI

There is a huge amount of hype around AI. Companies are growing faster than ever, IT budgets are being redirected, and product roadmaps everywhere are being redrawn. There is no doubt LLM’s are a transformative technology. At the same time, as with any early technology cycle we are far from understanding the patterns of success. And for sure, mis-steps and bad takes abound.

Top Tableau Database Connectors in 2025

Tableau database connectors are changing the narrative of how to work with data from your dashboard. A few years ago, the ability to pull data from almost any source, including your company’s internal database or even a web API, and immediately convert it into powerful, interactive dashboards sounded too good to be true. Today, with Tableau database connectors, you can pull data from virtually any data source and convert it instantly into a powerful dashboard.

AWS FinOps: 15 Tools For Cost Visibility And Control

AWS remains the largest cloud service provider (CSP) of the 21st century. It also provides over 240 cloud-based products and services. In some cases, these services help customers like you collect, analyze, and act on data about cloud usage and related costs. In this post, we explore how AWS services support FinOps’ best practices, including the features they offer. If you are looking for even more robust AWS FinOps tools, we will also include third-party platforms.

GitLens 17.2: Commit Composer Preview, Streamlined UX, and Enterprise AI Controls

GitLens 17.2 is here with a comprehensive set of improvements designed to enhance how you work with Git. This release introduces Commit Composer – AI-powered commit organization, refines the Home View experience based on your feedback, expands AI model support, and delivers enterprise-grade security controls for teams using AI features. Let’s explore what’s new and how these features can improve your development workflow.

OWASP CI/CD Part 8: Ungoverned Usage of 3rd Party Services

The boundaries of what organizations build internally and what they adopt externally have blurred. Developers routinely integrate third-party services into critical CI/CD pipelines, often with minimal friction and limited oversight. This rapid plug-and-play convenience, while key to modern engineering velocity, is also quietly expanding the attack surface in ways many teams struggle to track - let alone govern.

Adding AI to applications using the Model Context Protocol

Large Language Models (LLMs) are now at the cutting edge of mainstream AI systems. Their impact has been seismic, sparking a new gold rush as application developers transform the user experience away from clicks and commands into natural language and advanced automation. However, application developers have a barrier to overcome. AI models need data to reason and respond to a particular application domain.

Effective infrastructure automation to reduce data center costs

Today, managing a data center requires striking a balance between cost, security, and performance. Long-term costs are a different matter, even though upfront capital expenditures (CapEx) like real estate and hardware are well-known and reasonably predictable. According to industry surveys, operational expenses (OpEx), which include system provisioning, patching, compliance, and troubleshooting, steadily increase over time and frequently exceed 50% of total cost of ownership (TCO) by the third year.

Prometheus Logging Explained for Developers

Running apps in production? You need visibility fast. Traditional logging gives you scattered events. Prometheus gives you structured, queryable data that scales. In this guide, we’ll break down how to use Prometheus for logging-style observability, where it fits in your stack, and how to plug it into tools like Grafana or your cloud-native setup.

Risk and the problems of 3rd party software dependencies

Docker's VP of Product, Michael Donovan, discusses the importance of risk management and the security challenges introduced by the scale of 3rd party software dependency in development. See the full webinar: https:/cloudsmith.com/webinars Get to know Cloudsmith: About Cloudsmith We offer the world's best cloud-native artifact management platform to control, secure, and distribute everything that flows through your software supply chain. Cloudsmith operates at enterprise scale, reduces risk, and streamlines builds.

Using a Kubernetes credential provider with Cloudsmith

Join Ian Duffy, Senior Site Reliability Engineer at Cloudsmith, as he discusses using credential providers in Kubernetes to securely pull images from private repositories. Credential providers are a great new feature that appeared in recent versions of Kubernetes. They allow you to pull images using a short-lived authentication token, which makes them less prone to leakage than long-lived credentials - bolstering security in the software supply chain.

Docker Stop vs Kill: When to Use Each Command

When a container starts consuming excessive memory or becomes unresponsive, you need a way to shut it down. The two primary options — docker stop and docker kill,both terminate containers, but they operate differently and have different implications. The key difference: docker stop sends SIGTERM for a graceful shutdown, then escalates to SIGKILL if the process doesn’t exit in time. docker kill skips straight to SIGKILL, terminating the container immediately.

Rollbacks, Red Eyes And Unreliable Deployments

We spoke to data professionals from a range of industries about the impact of unreliable database deployments — not just on their systems, but on their workload, time, and well-being. From delayed releases to weekend firefighting, and the fallout for teams and customers, they share the day-to-day pressures they face and the small changes that help make deployments, and life, a little less stressful. What stood out from these conversations?

Smarter Data Center Capacity Planning for AI Innovation

Global demand for data center capacity is skyrocketing. From 2023 to 2030, power consumption across data centers is expected to grow by up to 22% annually, driven primarily by generative AI (GenAI) workloads. By 2030, AI workloads are predicted to account for 70% of total demand. This demand doesn’t just mean more hardware; it necessitates high-density computing environments to support training large language models like GPT and real-time inference systems.

GenAI: 80% Adoption by 2026... Are You Ready?

In this video, we explore the growing adoption of Generative AI in enterprise, the common pitfalls companies face, and how to build GenAI infrastructure that’s secure, scalable, and production-ready. We also introduce how relaxAI, Civo’s AI assistant, helps solve key challenges around privacy and infrastructure, giving you full control by bringing the LLM to your data.

Goodbye imagePullSecrets, Hello Kubernetes Credential Providers

Previously, we showed you how to securely pull Docker images from Cloudsmith to Kubernetes using OIDC with a CronJob-based approach. We concluded the post discussing credential provider plugins from Kubernetes 1.20 and an enhancement in Kubernetes 1.33 that offers a new approach for external registries like Cloudsmith. We have now built a credential provider that takes advantage of this new capability. This article explores what this means for the future of pulling images from Cloudsmith on Kubernetes.

Entity Developer Overview: Visual ORM Designer for .NET

Entity Developer is a powerful visual ORM designer for.NET that helps you build, edit, and manage your data models faster and more efficiently. Whether you're using EF Core, NHibernate, LinqConnect, or classic Entity Framework, Entity Developer streamlines the process with rich design tools, customizable templates, and full integration with Visual Studio. In this video, we’ll walk you through the key features and show how you can.

Managed IT Services in Mississauga: Reliable Solutions

In today's fast-paced business environment, having reliable IT support is crucial for success. Companies in Mississauga are no exception, as they face unique challenges in maintaining their technology infrastructure. With the increasing demand for efficient and secure IT systems, businesses are turning to managed IT services to stay ahead of the curve.

Fewer Bindings, More Power: Rancher's RBAC Boost for Enhanced Performance and Scalability

Managing permissions in sprawling Kubernetes landscapes can often feel like untangling an ever-growing knot. As clusters and user bases expand, so does the intricate web of RoleBindings, impacting everything from UI responsiveness to the very stability of etcd. This complexity, if unaddressed, can become a significant hurdle to achieving scalability and maintaining optimal performance in Rancher. SUSE is committed to improving its container management platform.

Engineering Excellence Summits Recap

The best engineering teams ship quality software quickly, but doing that consistently requires more than just speed. It requires careful attention to reliability, security, ease of maintenance, and developer experience. The Engineering Excellence Summits were designed to create a community of engineering leaders looking to connect with others facing similar challenges, share approaches that are working, and learn what “better” can look like.

Platform engineering with a product-management mindset

To really make an impact, platform engineering teams need to start thinking like product managers. That means deeply understanding their users, measuring outcomes instead of outputs, and tying everything they do to real business value. Organizations who care about total cost of ownership and fast time to value are adopting this mindset.

Securing AI with AI-SPM: The Next Step in AI Risk Management

The conversations around artificial intelligence (AI) typically revolve around its vast potential: writing applications, automating tasks, or transforming entire industries. However, despite the excitement around AI’s potential, the more pressing issue for many organizations is how to manage the risks of deploying it at scale across the enterprise. This is where AI Security Posture Management (AI-SPM) comes into play.

What we learned from load testing Shopware at scale

We ran real-world load tests across seven different infrastructure plans—from Grid to Dedicated Split—using realistic conversion rates, bot traffic blends, and ERP-driven API imports. The findings were clear: performance scales predictably with resources, but only if your code, cache, and configuration keep up. This blog post walks through key results, why API load is disproportionately expensive, and what metrics matter most. How well does Shopware actually perform under load?

On-Call Schedules: Everything You Need to Know

I use Slack daily. It works perfectly fine. Outages rarely happen. Even if they happen, they are resolved quickly. And this is the same for many other tools. But how are they all doing it—Keeping services running and resolving issues quickly? The secret: On-Call Schedules. On-call schedules make sure someone is always available to handle emergencies, so your systems stay reliable.

DevEx Unpacked 006 - Leadership, Scaling & Serving Developers with Glenn Weinstein

Episode 006: In this episode of DevEx Unpacked, Cloudsmith co-founder Alan Carson sits down with CEO Glenn Weinstein for a deep dive into leadership, growth, and developer-first thinking. Glenn shares his journey from programming on a Commodore PET to founding and selling a startup, his lessons from Twilio, and what drew him to lead Cloudsmith. The two discuss what it takes to build a category-defining company from Belfast, navigating VC funding, and how values like resilience, clarity, and service drive long-term success.

The Real Cost of DIY Infrastructure Management vs. Enterprise-Ready Solutions

Many IT teams underestimate the true cost of managing infrastructure themselves. At first glance, DIY tools may seem like a cost-effective and flexible solution — but the workflows you build and manage with in-house tooling reveal a host of hidden expenses, inefficiencies, and risks as your IT scales. While it’s not a new problem, it’s one that’s revealing itself more and more clearly as time goes on.

Log Management and Query Optimization in Kibana

When troubleshooting with the Elastic Stack, Kibana is often the interface you’ll rely on to query and visualize logs. It doesn’t change the data—it just makes it searchable and a bit easier to work with under pressure. If you’re investigating an outage, tracking performance issues, or trying to correlate events across services, Kibana’s log exploration tools can speed up the process, assuming they’re configured and used well.

What is Container Orchestration

In the simplest of terms, container orchestration is the automated process of deploying, managing, scaling and networking containers. Containers are lightweight, portable self contained units that include an application or the processes needed to run applications. Docker is a great example of a project that helps to containerize or package applications, and was a large reason why containers gained such popularity around 2013. Before Docker there were Linux Containers (LXC).

Telstra Programmable Network Is Being Discontinued. Here's How to Migrate

Learn how you can use Megaport to successfully migrate from TPN. Telstra has announced that its Telstra Programmable Network (TPN) service will be fully retired by January 2026, with key service milestones beginning as early as July 2025. But for teams that rely on TPN, this change doesn’t have to disrupt operations.

Infra Assistant in Puppet: Talk to Your Infrastructure with Natural Language

Improve productivity and get critical insights faster — no Puppet experience required. Infra Assistant is a natural language interface for interacting with Puppet data and services, saving time and making critical insights available faster than ever. Powered by Perforce Intelligence, Infra Assistant helps teams increase efficiency, reduce manual effort, eliminate bottlenecks, and unlock the goldmine of data on your infrastructure — all while maintaining enterprise-grade security and compliance.

AWS Vs. GCP: Which Platform Offers Better Pricing?

Although Amazon Web Services (AWS) still holds about a third of the cloud services market, Google Cloud Platform (GCP) has grown rapidly in recent years. Among the reasons for GCP’s rapid adoption is its simpler pricing structure compared to Amazon Web Services and Microsoft Azure. Is that really the case? And how do AWS and GCP pricing differ? But first, a quick background.

Access Logs: Format Specification and Practical Usage

Your server's been logging everything—it’s just easy to overlook until something breaks. Every incoming request, database call, or auth check ends up in your access logs. They’re not flashy, but they quietly document every interaction your system handles. For developers, they’re often the most reliable starting point when things go wrong. In this blog, we'll take a look at what an access log is, its format, types, and a few best practices.

AI is now writing code at scale - but who's checking it?

As Generative AI (GenAI) reshapes the software development landscape, the risks and complexities around managing what gets built, where it comes from, and how it’s secured are growing just as fast. The Cloudsmith 2025 Artifact Management Report dives into this shift, offering critical insights into how teams are adapting their infrastructure and software supply chain security practices in response to the AI-generated code.

Infrastructure Management: Containers vs Virtual Machines

Trends in tech come and go, but certain underlying primitives stick around forever. In software, two such primitives are virtual machines and containers. Virtualization paved the way for the cloud to become massive. Data centers would likely never have been commercially viable without it. While still relatively new, containerization has already made a serious mark on the software engineering world.

GitKraken Desktop 11.2: Merge Conflicts, Meet AI (and More Dev-Quality-of-Life Wins)

We’ve been steadily building something powerful into GitKraken: AI that understands your code and your context. In recent releases, GitKraken AI has already helped you: Now, in version 11.2, it’s tackling one of the most frustrating parts of your day: merge conflicts.

Azure CDN for Static Assets, APIs, and Front Door

If your users are spread across the globe but your servers are sitting in Virginia, you’ll probably hear complaints about slow load times, especially from places like Australia. CDNs fix this by caching static assets closer to where your users are. Azure CDN does exactly that, and it fits well if you're already using Azure services. You can hook it up to Blob Storage, App Services, or your origin. This guide covers how to set it up, what to expect, and how to know it’s working.

How to Build a Zero Ticket Service Catalog with IT Service Desk Automation

In a traditional IT environment, the service catalog has long functioned as a directory—an online menu of things users can ask IT for. Need a new laptop? Submit a ticket. Need Salesforce access? File a request. Every need, every problem, every question gets funneled into a form, queued up for manual processing, and eventually (and hopefully) resolved. But this static, ticket-heavy model can’t keep up with the pace of business today. Employees expect seamless, self-service experiences.

Hyperview DCIM vs. Nlyte DCIM: Which Software is Right for You?

Hyperview stands out with its transparent and flexible subscription-based pricing. Being a cloud-based solution, it eliminates hefty upfront costs often associated with traditional DCIM software. Updates and upgrades are rolled out seamlessly with zero downtime, and there are no hidden fees, making it highly budget-friendly for businesses looking to control long-term costs.

The DigitalBridge Blueprint: Marc Ganzi on Power, Capital, and the AI Revolution | Uplink | Ep.10

What fuels the AI revolution, cloud expansion, and always-on connectivity? Infrastructure. In this episode of Uplink, host Michael Reid sits down with Marc Ganzi, CEO of DigitalBridge, to explore the rise of the digital infrastructure economy. With $100B+ in assets under management, Marc and his team are building the physical foundations of the internet—from hyperscale data centers to mobile towers and subsea cables.

How SaaS Companies Can Profitably Price AI Agents

AI agents are undoubtedly exciting. What company would turn down the opportunity to use an intelligent bot that can make decisions, perform complex tasks, and take on the tedious work employees would normally have to deal with? On paper, AI agents sound like tremendous time and money-savers. The reality, however, is a bit different from the fantasy.

CVE-2025-3248: Serious vulnerability found in popular Python AI package

Researchers at Trend Micro have uncovered a critical unauthenticated remote code execution (RCE) vulnerability affecting Langflow versions prior to 1.3.0. Langflow is a Python-based visual framework for building AI applications and boasts over 70,000 stars on GitHub and over 21,000 global weekly downloads from the public PyPI upstream. Source: Cloudsmith Navigator Versions released before 1.3.0 contain a serious flaw in the code validation logic, which allows arbitrary code execution.

Are You Correctly Deploying LLMs on Kubernetes in 2025?

We are in mid-2025, and teams across industries are rolling out large language models, or LLMs, to power everything from conversational agents to document understanding. However, getting them to run smoothly in production… That’s still a challenge. A working model isn’t just about putting it in a container and tossing it into a Kubernetes cluster.

GitKraken Desktop 11.2: AI Conflict Resolution, Explained (Preview)

In GitKraken Desktop 11.2, we're introducing AI Conflict Resolution, now in Preview. This update lets GitKraken AI suggest context-aware merge resolutions and explain its decisions, helping you resolve conflicts faster and with more confidence. We've also added hunk reverts, avatar support in the Commit Graph, and improvements to Commit Explain. Timestamps: Watch the full walkthrough, share your feedback, and stay tuned. The future of Git tooling is getting smarter.

Solving Transportation & Logistics Device Management Woes With AirDroid Business

With features like Kiosk Mode, automated tasks, and robust reporting, managing thousands of devices has never been easier. Learn how to lock devices to essential apps, automate routine maintenance, and gain insights into device usage—all in one powerful solution. Don't let device management hold you back; transform your operations with AirDroid Business.

Meet dbForge AI Assistant: Your Invaluable Sidekick That Reinvents SQL Coding

If writing SQL code is part of your daily work, you won’t be surprised by things like context-aware code completion, as-you-type syntax validation, and debugging. All of them are designed to enhance and speed up your SQL coding. But what if we said that you could completely reshape your experience with an AI-powered assistant? What if you could have your error-free queries generated, analyzed, and optimized in just a couple of moments?

20 Azure Cost Management Tools For Cloud Savings

Toward the end of Q1 2022, survey findings reported that Microsoft’s Azure cloud computing services had, for the first time, eclipsed Amazon Web Services (AWS) in some enterprise categories. According to the respondents, more enterprises preferred Azure because it integrates well with the many Microsoft products they already use. A second reason is that Azure is suitable for running on-premises and at the edge. Some organizations also use Microsoft Azure to avoid vendor lock-in to AWS.

Introducing Netdata Insights

Now in research preview: Netdata Insights The problem: Incident? You're jumping between dashboards, piecing together timelines. Reporting? You're copy-pasting charts and correlating trends by hand. The data’s there, but turning it into a narrative doesn’t scale. The solution: Netdata Insights. Synthesizes our high-fidelity telemetry using the latest LLMs into AI-powered reports with natural-language explanations, visuals, and clear recommendations.

2025 - The Year of Data Repatriation

For many businesses, 2020 marked the dawn of the cloud-first era, with organisations around the world embracing public cloud. And it made sense at the time; promise of reduced infrastructure costs, flexibility and scalability meant that leveraging cloud services was a no-brainer. But with any new technology, the shifting tides that come along with its proliferation also informs the cyclical nature of its adoption.

OWASP CI/CD Part 7: Insecure System Configuration

Insecure system configuration is a textbook example of how neglected settings can create an entry point for attackers targeting your CI/CD pipelines. It’s rarely the cutting-edge zero-day that causes a breach. More often, it’s the unpatched service, the overly permissive role, or the default password that was never changed. While this risk overlaps with CI/CD credential hygiene (covered in Part 6 of our OWASP CI/CD series), the focus here is much broader.

Announcing Qovery Observability: the simplest way to understand your application

We are thrilled to announce the next major milestone in our platform vision: Qovery observability! Qovery Observability is our new product, ready to give you the fastest way to gain a crystal-clear, unified understanding of your application and infrastructure. Fully managed, zero lock-in, you keep the data. Devs love it, no DevOps needed. Coming soon!

DevEx Unpacked 005 - Secure DevOps, Rego Policies & Growing Cloudsmith with Ciara Carey

Episode 005: In this episode of DevEx Unpacked, Alan Carson chats with Ciara Carey, Solutions Engineer at Cloudsmith, about her career journey from developer to DevRel to her current customer-facing role. Ciara shares real-world insights on software supply chain security, how teams are using Enterprise Policy Management (EPM) to control open source risk, and why Cloudsmith’s cloud-native platform is a game changer for DevSecOps workflows.

Kubernetes sidecar deployment using CircleCI

Kubernetes excels at managing complex, containerized systems, and one of its most impactful patterns is the sidecar. Sidecar containers extend applications by running supplementary processes in tandem. This modular architecture enables enhanced observability, networking, or security layers — all without changing the core application code. Continuous Integration and Continuous Deployment (CI/CD) practices are key to reliably shipping these configurations.

2025 Cloud Pricing Comparison: An In-Depth Guide

Over $44.5 billion in cloud spend goes to waste annually, per the FinOps Foundation. No wonder reducing unnecessary costs is critical to protecting your margins. A logical place to start? Cloud service pricing. Providers like AWS, Azure, and Google Cloud continue to evolve their pricing models. They are offering new discounts, regional rates, and shifting commitments. All to win your business. Yet, a cloud pricing comparison alone doesn’t give you a complete picture.

How to test your systems for scalability and redundancy with fault injection

Part of the Gremlin Office Hours series: A monthly deep dive with Gremlin experts. Do you know if your services can tolerate losing a node? What about an entire availability zone? Or a region? Large-scale outages aren’t unheard of. When you’re running critical services, it’s vital that those services can keep running even if an AZ or region fails. In addition to failing over, these services also need to scale quickly so traffic shifts don’t overwhelm your systems. How do you prove that a service is both scalable and redundant? The answer is with Fault Injection.

How to be prepared for cloud provider outages

GCP’s recent outage on June 12th was a reminder of just how interconnected modern architectures are. The 2 hour and 28 minute outage affected dozens of companies and spanned 80+ Google services and products. But what was really illuminating was just how far the outage spread due to hidden dependency risks. Many companies that don’t run on GCP were startled to find their services suddenly affected because they had dependencies or depended on vendors that did use GCP.

Automating machine learning security checks using CI/CD

Machine learning (ML) pipelines are increasingly being treated like software; built, tested, deployed, and monitored using automated tooling. But while infrastructure as code and microservices have matured with security best practices, ML systems often lag behind. The truth is, your ML pipeline is part of your software supply chain and it is vulnerable.

Build an AI-powered Golang code review agent with CircleCI and GitHub webhooks

Code reviews are a crucial step in maintaining code quality, but many developers find them tedious and inconsistent. What if you could get helpful feedback automatically, as soon as a pull request is opened? In this tutorial, you’ll learn how to set up and integrate an AI-powered code review agent into your Go project. The agent uses the OpenAI API to post contextual suggestions and praise directly on pull requests.

Everything You Need to Know About Event Logs

Your code passes locally, CI is green, and the deploy goes through. Then production throws a 500, and the trace isn’t helpful. And here, event logs help. A log captures timestamped records of what the app did HTTP requests, DB queries, cache misses, retries, failures. These entries give you enough context to debug without reproducing the issue locally. Especially when dealing with distributed systems, logs are often the only consistent source of truth.

Our Golang Stack in 2025

In our Go projects, we rely on a consistent and battle-tested stack of libraries that help us build reliable, maintainable, and scalable systems. We started using Go in our stack many years ago (before Go v1) and therefore many of our choices have changed over the years. Here in this post, I wanted to share some of the libraries we use regularly to power our Go apps.

Anbox Cloud 1.26.0: what's new?

In this video, Anbox team covers new features and changes in Anbox Cloud 1.26.0 release: Deployment and operations Instance management Logging Dashboard enhancements Images Streaming CVEs What is Anbox Cloud? Anbox Cloud lets you run virtualized Android environments securely, at any scale, to any device letting you focus on your use case. Run Android in system containers, not emulators, on AWS, OCI, Azure, GCP or your private cloud with ultra low streaming latency.

An Introduction to Bitbucket Pipelines

This video provides a 90-second introduction to Bitbucket Pipelines to help you get started with CI/CD in Bitbucket. Bitbucket pipes, which are reusable pipeline steps, are introduced. About Atlassian: Behind every great human achievement, there is a team. From medicine and space travel to disaster response and pizza deliveries, we help teams all over the planet advance humanity through the power of software. Our mission is to help unleash the potential of every team.

DevEx Unpacked 004 - Scaling Startups, Blockchain & Developer Culture with Jack Spargo

Episode 004: In this episode of DevEx Unpacked, Alan Carson chats with Jack Spargo, CTO of Control Alt, about his fascinating career journey from aerospace engineering to leading blockchain-powered investment platforms. Jack shares lessons from being acquired overnight, the challenges of building a platform from scratch, and why he’s betting big on junior engineers and AI augmentation. They explore the realities of compliance, software supply chain security, and why Northern Ireland is fast becoming a serious start-up hub.

Verizon Discusses Network Transformation at Ribbon Insights

In a recent presentation, Verizon’s Steve Ownes discussed their strategic initiative to accelerate the decomissioning (decom) of TDM switches, underlining the significance of repurposing legacy infrastructures in favor of modern architectures. Ribbon’s guest Steve Owens kicks things off with a light-hearted reference to "Sanford and Son” showing how relics can be transformed into gold through effective management and innovation.

Could your Palo Alto firewall do more to protect you against Shadow AI?

In recent months, my conversations with fellow technology leaders have consistently revolved around two key themes: how we leverage AI to drive innovation and efficiency, and how we mitigate the inherent risks associated with AI. However, I’ve noticed a concerning gap – while enterprises are busy strategizing the adoption of AI to enhance productivity, reduce costs, and outpace competitors, very few are addressing how AI is being actively used today by their own teams.

Fluent Bit Helm Chart: Simplify Log Collection in Kubernetes

Collecting logs in Kubernetes often starts as a simple goal, and quickly turns into a game of “where did that log line go?” Between sidecars, DaemonSets, and countless config options, it’s easy to get lost. Fluent Bit helps cut through the noise. It's fast, lightweight, and plays well with Kubernetes. And when you deploy it using Helm charts? The setup becomes way more manageable. This guide covers the how and the why, without overcomplicating the what.

Datadog + OpenAI: Codex CLI integration for AIassisted DevOps

We are exploring how we can help on-call engineers troubleshoot incidents more effectively by providing the OpenAI Codex agent with access to real-time observability data in terminals. We've developed an integration and new tool visualizations that connect OpenAI's Codex CLI to the new Datadog MCP server. In this post, we'll share what we've been experimenting with: enabling an AI agent to retrieve production metrics, logs, and incidents from Datadog in real time and act on that context.

Ops Explained: AIOps vs. DevOps vs. MLOps vs. Agentic AIOps

There’s a common misconception in IT operations that mastering DevOps, AIOps, or MLOps means you’re “fully modern.” But these aren’t checkpoints on a single journey to automation. DevOps, MLOps, and AIOps solve different problems for different teams—and they operate on different layers of the technology stack. They’re not stages of maturity. They’re parallel areas that sometimes interact, but serve separate needs.

DevEx Unpacked 003 - Scaling Cloudsmith, Security Innovation & Developer DNA with Tom Gibson

Episode 003: In this episode of DevEx Unpacked, Alan Carson sits down with Tom Gibson, Principal Engineer and long-time Cloudsmith team member, to trace his journey from early start-up to leading strategic innovation in the CTO’s office. Tom shares behind-the-scenes stories about engineering through scale, building continuous security scanning, and what it takes to evolve a developer-first platform.

Is AI the Future of Software Development, or Just a new Abstraction? Insights from Kelsey Hightower

Join Kelsey Hightower as he shares his thoughts on the current state of AI and its potential impact on software development. In this discussion with Mark Boost and Dinesh Majrekar, Kelsey explores the possibilities and limitations of AI, and how it may change the way we build and interact with software. From the importance of pragmatism to the role of abstraction, Kelsey offers valuable insights for developers, engineers, and anyone interested in the future of technology.

Supercharge your iOS and MacOS development: CircleCI offers M4 Pro resources

For developers building on iOS and macOS, building the most performant software means having access to the latest Mac resources to quickly build, test, and deploy software. Apple’s newest M4 Pro chip represents yet another significant leap in Apple Silicon performance, delivering unprecedented speed and efficiency for development teams.

Designing Secure Healthtech Systems for Long-Term Patient Trust

Digital transformation in healthcare has accelerated rapidly, bringing an influx of connected platforms, from electronic health records and patient portals to wearable diagnostics and telemedicine tools. As more patients interact with healthcare systems through digital interfaces, the stakes have risen dramatically. In this high-trust environment, cybersecurity is a core component of patient confidence and operational integrity.
Sponsored Post

7 Best Service Virtualization Tools of 2025

Service virtualization tools have become indispensable for organizations seeking to streamline their testing and development processes. These tools allow teams to simulate the behavior of critical software components, enabling more rapid development with overall cost reduction and improved collaborative outcomes. As demand mounts for service virtualization solutions, identifying the best tools to support this workflow in the software development lifecycle has never been so important.

An Easy Guide to Getting Started with Elastic APM

Code in production will break. Maybe a request takes too long, maybe it fails quietly, or maybe it works fine one minute and falls over the next. Logs can help, sure—but they don’t always show the full picture, especially when performance issues are involved. Elastic APM gives you a clearer view. It traces what your application is doing from incoming requests to database queries and everything in between.

A Simple Guide To GKE Cost Allocation And Cluster Spend

Running workloads on Google Kubernetes Engine (GKE) delivers impressive scalability and flexibility. Yet, it can also introduce a tricky challenge: tracking GKE costs accurately. Remember, GKE costs rarely scale linearly. Overprovisioned nodes, idle autoscalers, and orphaned workloads can quietly balloon your bill in the background. And while GKE’s native tools offer some visibility, they often miss the full picture.

Achieving Sovereign AI with the JFrog Platform and NVIDIA Enterprise AI Factory

Sovereign AI ensures control over AI/ML data, models, and infrastructure, which is now essential for enterprises, regulated industries, and national interests. JFrog and NVIDIA have collaborated to deliver a secure, scalable solution for sovereign AI. NVIDIA provides the accelerated computing and AI software while JFrog ensures trusted DevSecOps and MLOps practices across the entire AI lifecycle, from model development and security scanning to deployment at the edge and in air-gapped environments.

Compare two PostgreSQL databases with Redgate pgCompare

In this 3 minute video, Redgate Advocate Grant Fritchey walks you through Redgate's latest comparison technology, Redgate pgCompare. With Redgate pgCompare, you can quickly compare two PostgreSQL databases and synchronize the changes. Redgate pgCompare is in preview and available to download for free. Take it for a spin and share your feedback.

Introducing Environment Policy- Gain Unified Control Over Compliance Requirements Across Your Runtime Environments

In modern software development, different environments often have different compliance requirements. Your development environment might allow more flexibility, while production demands strict controls around security scans, testing, and code review. Environment Policy helps you codify these requirements and enforce them consistently.

Sustaining the demand for AI in Asia with investment in subsea cable infrastructure

Across the Asia Pacific region significant investment is going into new subsea cable infrastructure that will help sustain the long-term demand for AI. We’ve written a lot on this blog about the impact of AI on networks and how AI workloads require low latency, high-capacity data transfer. This in turn puts more pressure on existing network infrastructure and in particular subsea cable systems - which provide the global backbone for cloud platforms and data centres.

Infrastructure Management: When to Pick Bare Metal or Virtualized Servers

Infrastructure management isn't about taking sides. Too often, teams get pulled into “X is better than Y” debates that miss the bigger picture: your compute stack should serve your needs, not industry hype. A common decision point in the past has been the choice between bare metal or cloud hyperscalar virtualization. Nowadays, the answer isn't 1 or 0.

The Future of WAN Design Depends on Network as a Service (NaaS)

Megaport and AWS explore how Network as a Service (NaaS) transforms WAN design with cloud-native agility, on-demand provisioning, and GenAI-ready flexibility. Co-authored by: Rishi Katdare, Leader – AWS Core Networking & GTM, AWS Mokshith Kumar, Sr. GTM Specialist Solutions Architect – AWS Core Networking, AWS As enterprise architectures grow more distributed and cloud-native, traditional methods of building and managing Wide Area Networks (WANs) are reaching their limits.

Canonical delivers Kubernetes platform and open-source security with NVIDIA Enterprise AI Factory validated design

To ease the path of enterprise AI adoption and accelerate the conversion of AI insights into business value, NVIDIA recently published the NVIDIA Enterprise AI Factory validated design, an ecosystem of solutions that integrates seamlessly with enterprise systems, data sources, and security infrastructure. The NVIDIA templates for hardware and software design are tailored for modern AI projects, including Physical AI & HPC with a focus on agentic AI workloads.

Accelerate Network Automation with No-Code/Low-Code Tools

Discover how no-code/low-code platforms are transforming network automation. In this webinar, industry experts show how “citizen developers” can automate workflows without complex software projects. Live demo of no-code/low-code tools in action Real-world examples across the network operations lifecycle How AI supports smarter, faster automation Learn how to cut costs, streamline operations, and accelerate your automation journey—starting today.

DevEx Unpacked 002 - DevRel, Donuts & Distributed Systems with Dan McKinney

Episode 002: In this episode of DevEx Unpacked, Alan Carson sits down with Dan McKinney, one of Cloudsmith’s earliest team members and now Head of Solutions Engineering. Dan reflects on his unique journey from writing docs and filming DevRel videos to leading high-stakes enterprise sales. Discover how Cloudsmith scaled from a two-person start-up to a platform trusted by global enterprises, why software supply chain security is more urgent than ever, and what features make developers and security teams lean in.

Rancher Live: The Kubernetes report card

Join Divya Mohan live on July 17th at 2 PM UTC on to explore OpenReports—a new project for unified, API-driven reporting. Discover how OpenReports simplifies capturing and consuming policy, security, and compliance reports via a vendor-neutral API. See live demos, real-world use cases, and learn how this project brings clarity and consistency to Kubernetes reporting. Don’t miss it!

Why your Shopware store feels fast until it doesn't

Shopware is a powerful platform, but its performance depends entirely on how it is used. In this article, we explore the most common and avoidable causes of slowdowns in production environments, including plugin overload, cache fragmentation, and misconfigured admin settings. Whether you are preparing for a high-traffic event or simply aiming to keep your storefront fast and responsive, this guide will help you identify the root causes of performance regressions.

No Sandwich, No Security: What This Week's Lunch Taught Me About DNS Blind Spots

Like many shoppers in the UK this week, I found myself staring at half-empty shelves in my local grocery store. In a small but frustrating twist, my usual sandwich, chicken mayo on malted bread, was nowhere to be found. The disruption wasn’t just about lunchtime preferences; it was part of a broader impact from cyberattacks that hit major UK retailers, including Co-op and Marks & Spencer.

Azure Budget Planning: Simplify Cost Management and Forecasting

The video introduces the new Turbo 360 feature designed to simplify Azure budget planning for teams. It highlights how team managers can easily manage and project costs, input monthly records, and adjust budgets based on upcoming projects, all while minimizing reliance on technical resources. The focus is on enhancing productivity and making financial management more accessible.

OWASP CI/CD Part 6: Insufficient Credential Hygiene

This post, part six of our OWASP CI/CD Top 10 series, looks at some of the common risks associated with Insufficient Credential Hygiene. By better understanding the flaws that affect credential hygiene, we can better understand how even the most sophisticated pipelines were compromised.

Apache Spark security: start with a solid foundation

Everyone agrees security matters – yet when it comes to big data analytics with Apache Spark, it’s not just another checkbox. Spark’s open source Java architecture introduces special security concerns that, if neglected, can quietly reveal sensitive information and interrupt vital functions.

The Future of IT Is Human + Agentic: How Zero Ticket IT Is Reshaping Tech Careers

Automation has always stirred up fears of job loss. For IT professionals, the conversation has only grown louder with the rise of AI. But the truth is that the future of IT is not about replacement—it’s about reinvention. For decades, IT has been defined by its firefighting: manually resolving tickets, managing endless alerts, and fielding repetitive service requests. These tasks are ripe for automation, but automation doesn’t eliminate the need for IT talent.

How to Monitor Kafka Producer Metrics

Your Kafka producer pushed a million messages yesterday. Nice. But can you tell if they all made it? Or why did latency spike at 2 PM? Producer metrics help you determine that. They expose how long messages take to send, whether messages are getting stuck, and whether retries are piling up. Let’s go over which ones help while debugging and how to monitor them.

Opsgenie Is Shutting Down: Why FireHydrant Is the Natural Evolution

Opsgenie set a high bar. For years, it helped teams respond faster and stay on top of incidents with reliable alerting and on-call management. At FireHydrant, we’ve always admired how Opsgenie modeled incident data and structured its workflows — it was one of the best in the game. But as Atlassian sunsets Opsgenie and teams face the pressure to migrate, there’s a real decision to make: move into Jira Service Management, or find a new solution that fits your team’s needs and scale.

DevEx Unpacked 001 - Scaling Secure Software with Alison Sickelka

Episode 001: In this inaugural episode of DevEx Unpacked, host Alan Carson sits down with Alison Sickelka, VP of Product at Cloudsmith, for a deep dive into the evolution of software supply chain security. Alison shares her journey from journalism to product leadership, the unique talent landscape in Belfast, and how Cloudsmith is pioneering secure artifact management. Learn how Cloudsmith's Enterprise Policy Management is shaping compliance strategies, why SBOMs are crucial, and where AI fits in a secure DevOps future.

Introducing GitKraken MCP: AI Agents Just Got a Power-Up

With the latest iteration of the GitKraken CLI, you can now connect to a local MCP server to deliver more functionality to your agent of choice. Whether you are using GitHub Copilot, Cursor, Windsurf, or any other tool, you can now leverage the power of GitKraken’s MCP server to enhance your workflows.

#045 - Beyond Cluster Creation: Mastering Multi-Cluster Kubernetes with Gianluca Mardente (Cisco)

Join Itiel as he chats with Gianluca Mardente, a Principal Engineer at Cisco Systems. Gianluca shares his path to tech and Kubernetes, including his work history and the inspiration behind his open-source project, Sveltos. They dive into the significant challenges of managing a large fleet of Kubernetes clusters – ensuring consistency, handling upgrades, and coordinating resources across different clusters.

Multi-Stage Malware Attack on PyPI: Malicious Package Threatens Chimera Sandbox Users

Open-source package repositories like the Python Package Index (PyPI) play a crucial role in software development. However, these platforms are also potential targets for malicious actors attempting to exploit application software vulnerabilities. The JFrog Security Research team regularly monitors open source software repositories using advanced automated tools, in order to detect malicious packages.

Rancher Live: Balancing Open Source Activities in Corporate Environments

Join the discussion about how to balance Open Source Activities in the context of corporate live. Based on Amanda and Kim's talk at KubeCon Europe 2025 in London - Achieving a balance between corporate goals and open source activities is essential for organizations that offer and rely on both commercial and open source technologies. This balance can be hard to achieve when you have goals, needed results, and resource constraints all pulling in different directions.

Accelerate Oracle Cloud Infrastructure monitoring with Datadog OCI QuickStart

Datadog’s Oracle Cloud Infrastructure integration enables you to collect metrics and logs from your entire OCI stack and monitor them within a single platform alongside other third-party technologies. Datadog’s new OCI QuickStart is a fully managed, single-flow setup experience that helps you monitor your OCI infrastructure and applications in just a few clicks.

A Roadmap To AWS Savings Plans Vs. Reserved Instances

A decade after launching Reserved Instances (RIs), Amazon Web Services (AWS) introduced Savings Plans as a more flexible alternative to RIs. AWS Savings Plans are not meant to replace Reserved Instances; they are complementary. SPs and RIs have some significant differences that make each better suited to specific uses. For example, while Savings Plans apply to both EC2 and Fargate instances, RIs only apply to EC2 instances. Let’s break down AWS Savings Plans vs.

Open Source Automation Tools: Popular Options & How to Choose the Right One for Your Needs

When looking at the broad landscape of IT automation tools, you’ll find dozens (if not hundreds) of tools that seem like viable solutions to your automation needs. Almost all of those tools can be broken down into two categories: Open source automation tools and commercial automation tools. (Open source automation tools with a commercial offering are still considered open source, even if the commercial edition has a price tag.)

How to Automate Azure Cost Reports in Minutes?

Reporting Azure costs to stakeholders doesn't have to be a time-consuming task. In this video, Michael Stephenson - a FinOps certified practitioner, introduces a new feature that helps you generate ready-to-use PowerPoint reports—so you can walk into any meeting with clear insights on your team's Azure cost performance. From cost trends and anomalies to right-sizing recommendations, the Executive Summary Report gives you a complete picture in minutes. It's all about making Azure cost optimization easier, faster, and more actionable.

Zero Trust for Compliance: How Kosli Helps Engineers Automate the Paperwork

Engineers didn’t sign up to fill out forms, attend CAB meetings, or screenshot deployments. Yet that’s the reality of compliance in many organizations. In this video, Mike Long (CEO & Co-founder, Kosli) explains how Kosli helps software engineers eliminate the repetitive, meaningless tasks of traditional compliance — and replaces them with something automated, provable, and secure. Video Timeline.

The Full Picture of Software Delivery: How Kosli Connects Every Change to Its Origin

Software engineers don’t need more dashboards or forms. They need a reliable record of what actually happened in their systems—and how it ties back to the code. In this video, Mike Long (CEO & Co-founder, Kosli) explains how Kosli records every event in your SDLC and connects it to every system change. This gives you a full, auditable view of software delivery—from code to production.

How Console Connect enhances AWS Direct Connect for global cloud connectivity

In today’s always-on, cloud-first world, it’s no surprise that enterprises are demanding greater reliability, stronger security, and faster performance from their network infrastructure. AWS meets these needs with AWS Direct Connect, offering a dedicated, private connection to AWS that bypasses the public internet.

How Successful Teams Master Cloud Resource Management

Cloud computing promised speed, scale, and freedom. And it delivered. Engineers can deploy in seconds. Teams can scale globally overnight. But somewhere between all that freedom and speed, control got blurry. Resources piled up. Budgets ballooned. And suddenly, no one could answer the simple question: What are we paying for and why? Cloud resource management is how we reclaim that control, without slowing down.

Cisco Webex Edge Connect Launches on Megaport Voice and Video Exchange

Get superior performance for your business collaboration with Cisco Webex Edge Connect, now available on Megaport Voice and Video Exchange (VVx). In distributed workplaces, the difference between productive collaboration and frustrating delays comes down to one critical factor: connection quality. When meetings matter, whether they’re daily standups or critical client presentations, the underlying network performance directly determines success.

What's new in SQL Prompt Version 11.0

We’re excited to announce the release of SQL Prompt Version 11.0. SQL Prompt Version 11.0 adds support for SQL Server Management Studio 21 (SSMS 21) and includes compatibility with Azure Synapse Dedicated SQL Pools—building on our existing support for Azure Synapse Serverless SQL Pools. Read on to find out more about this new support along with other recent improvements to SQL Prompt.

Making the Case for Creating a Digital Twin of All Your Technical Spaces

Technology assets are no longer confined to the walls of a traditional data center. They now span a range of environments from core facilities and labs to distributed sites like IDF closets, manufacturing sites, and retail branches. Yet many organizations still rely on fragmented tools and manual processes to manage these distributed environments. This can result in gaps in visibility, inconsistent documentation, and higher operational risk.

How to Integrate OpenTelemetry Collector with Prometheus

Pulling observability data together is rarely clean. Metrics come from everywhere, formats vary, and making sense of it takes some work. OpenTelemetry Collector and Prometheus fit perfectly here. The Collector handles ingestion and processing from different sources, while Prometheus stores and queries the data. Simple, effective, and no vendor lock-in. In this blog, we cover how to integrate the Collector with Prometheus, common pitfalls, and ways to control costs.

Steve Owens of Verizon Discusses TDM Switch Decommissioning at Ribbon Insights 2024

The video discusses Verizon’s strategic approach to accelerating the decommissioning (decom) of legacy Time Division Multiplexing (TDM) switches. The speaker emphasizes the importance of TDM switch decom in reducing power consumption, cutting expenses, complying with regional climate regulations, and reclaiming valuable technical space. A key driver for Verizon is the steep costs associated with maintaining aging infrastructure under increasingly stringent local carbon emission laws, particularly in the Northeast Corridor.

A Complete Guide to Linux Log File Locations and Their Usage

Linux log files are text-based records that capture system events, application activities, and user actions. They're stored primarily in the /var/log directory and provide essential information for debugging issues, monitoring system health, and maintaining security. This guide covers the most important Linux log files and a few detailed techniques for reading and analyzing them.

Azure Cost Optimization Best Practices - you can actually implement

Explore actionable strategies for Azure cost optimization—helping you move from just monitoring to actually reducing spend. Discover how leading teams are embedding FinOps best practices, leveraging automation, and building a cost-aware culture to keep cloud bills under control.

Sovereignty, Liquid Cooling, and the New Infrastructure Hierarchy with Gavin Dudley

The AI revolution is here, and data centers are its beating heart! Gone are the days of data centers being just overlooked IT infrastructure; they are now the "cool kids on the block," essential for powering the most significant technological shift of our generation.

Secure Docker Image Pulls from Cloudsmith to Kubernetes using OIDC

Pulling Docker images from private registries for containerised applications presents a security challenge. It requires authentication management, network access, and trust across distributed systems. Credentials must be securely handled and rotated, and image pulls can break due to network restrictions or expired tokens. All of this makes deployment and security harder.

OWASP CI/CD Part 5 - Insufficient PBAC

One of the more overlooked yet critical vulnerabilities highlighted in the OWASP Top 10 for CI/CD Security Risks is Insufficient PBAC (Pipeline-Based Access Controls). Let’s unpack what PBAC is, why it's essential, and how you can leverage modern access control tools like Open Policy Agent (OPA) and Rego to mitigate these risks effectively.

Easy Method for Monitoring MinIO Performance Using Telegraf

MinIO is a high-performance, S3-compatible object storage server built for cloud-native applications. It’s open-source, lightweight, and incredibly fast which makes it a solution for developers who need to store and serve unstructured data like images, logs, or backups. Whether you’re building a self-hosted alternative to Amazon S3 or running MinIO as part of a local development pipeline, it fits into modern containerized environments.

8 Challenges Data Center Managers Must Overcome in 2025

Rising power consumption is now a defining issue. Emerging AI models, GPU-accelerated computing, and dense workloads are driving spiraling demand for electricity. While hyperscale data centers continue to expand, even midsize operators feel the strain of supporting power-hungry applications. Simultaneously, sustainability is moving to the forefront. Regulations like the European Union’s Energy Efficiency Directive (EED), U.S.

9 Best OpenShift Alternatives For Today's DevOps Teams

OpenShift delivers a lot right out of the box. And for many teams running at enterprise scale, it’s exactly what they need. The platform offers a built-in container registry, observability tools, and service mesh support. You also get integrations for GitOps, serverless, and even ML workflows. OpenShift combines powerful orchestration with developer tooling, CI/CD pipelines, and enterprise-grade security. All under one roof.

Flexible, Evidence-Driven Compliance: Meet Kosli's Custom Attestations

At Kosli, we believe that governance in software delivery shouldn’t be a bottleneck – it should be an extension of how your teams already work. That’s why we’re excited to introduce custom attestations in Kosli. Here’s the short version: What are custom attestations? They let you record facts about your workflows – with evidence – using controls that actually match your processes. Why does this matter? Because generic attestations can miss the mark.

Bunnyshell Named Startup of the Year 2024 in Palo Alto by HackerNoon

"If AI is writing the code, we make sure it runs." Alin Dobra, Founder Bunnyshell We’re proud to announce that Bunnyshell has been named Startup of the Year 2024 in Palo Alto by HackerNoon! This recognition reflects the work we’ve done to build the Software Delivery Platform for a new era—where code is written by AI, but validated by real environments.

Hyperparameter tuning for LLMs using CircleCI matrix workflows

Hyperparameter tuning is a critical step in optimizing large language models (LLMs). Parameters such as learning rate, batch size, weight decay, and number of training epochs can significantly affect convergence behavior and final model performance. While several approaches like grid search or random search are widely used, executing them manually is inefficient; especially when each training run is compute-intensive.

Working with GPUs on Kubernetes and making them observable

GPUs are everywhere powering LLM inference, model training, video processing, and more. Kubernetes is often where these workloads run. But using GPUs in Kubernetes isn’t as simple as using CPUs. You need the right setup. You need efficient scheduling. And most importantly you need visibility. This post walks through how to run GPU workloads on Kubernetes, how to virtualize them efficiently, and how Coroot helps you monitor everything with zero instrumentation or config.

AI in Action with Kunal Kushwaha: 2 Demo Showcase. See What's Possible!

Join Kunal Kushwaha, Field CTO at Civo, for two demos using relaxAI. In the first demo, we'll show you how to deploy your own Large Language Model (LLM) inference engine using Ollama, giving you full control over your AI model. In the second demo, we'll demonstrate how to build custom AI integrations using relaxAI API, making it easy to add AI features to your existing applications. Whether you're an AI developer, MLOps team, or just curious about AI, this video is for you.

Open Container Initiative (OCI) Support in Cloudsmith

Kubernetes has become the de facto platform for orchestrating containers. Open standards complement Kubernetes by defining best practices for its implementation. These standards are developed by the open-source Kubernetes community (not a single vendor), ensuring vendor neutrality, easier integration with other tools, and overall system efficiency.

Multiple Malicious Packages Discovered on PyPI, npm, and RubyGems

Evidence of broad and sustained attacks using several npm, Python, and Ruby packages continues to emerge. A series of malicious packages have been added to the npm, PyPI, and RubyGems package repositories. The attacks have been ongoing for some time, with some seeded years ago. Their aims are manifold, including stealing funds from crypto wallets, deleting codebases, and obtaining Telegram messaging data.

What if your container images were security-maintained at the source?

Software supply chain security has become a top concern for developers, DevOps engineers, and IT leaders. High-profile breaches and dependency compromises have shown that open source components can introduce risk if not properly vetted and maintained. Although containerization has become commonplace in contemporary development and deployment, it can have drawbacks in terms of reproducibility and security.

Top 5 Observability Tools DevOps Teams Should Know

Observability and monitoring are the cornerstone of resilient, high-performing applications. Nearly every IT or software engineering leader we come into contact with emphasizes the importance of the ability to understand and diagnose what is going on with their applications at all times. Having clear and concise visibility into your applications is no longer optional.

How to Configure and Optimize Prometheus Data Retention

Prometheus can be lightweight to start with, but once it’s in production, storage usage tends to grow faster than expected. Managing how long data is kept becomes critical, especially when you're working with limited disk space or tight budgets. This guide outlines the key concepts behind Prometheus data retention, how to configure it effectively, and what to watch out for.

What's New in Turbo360 - Azure Budget planner, Executive cost summary report, Snooze monitoring....

Turbo360 brings a suite of enhancements added to elevate your Azure management experience. Hit play to hear what's in store for this month. Introduction (00:00:00) Executive cost summary report (00:00:15) Budget planner (00:01:14) Pause monitoring feature (00:01:51) Exporting and Importing business transactions (00:02:23) Direct Access to Azure Resources (00:02:55) Conclusion (00:03:17)

Watch Automation in Action - Live! (June 2025)

Tired of endless alerts, escalating ticket volumes, and constant firefighting? It’s time to take back control. Join our next live demo to see how the Resolve platform orchestrates workflows that actually work! Reduce noise, slash resolution times, and free up your IT and network teams to focus on what really matters. We’ll show you how Resolve can: Get an exclusive, behind-the-scenes look at how automation and orchestration can help you scale your operations efficiently.

Reliable Dedicated Servers as the Foundation of Scalable DevOps Architecture

Imagine launching a product update at peak traffic time. Your development team pushes the changes, expecting everything to run smoothly. But instead of seamless deployment, the infrastructure buckles-delays spike, user complaints pour in, and error logs flood your screen. Sound familiar? In the world of DevOps, where agility and uptime are non-negotiable, the strength of your backend setup determines how fast-and how safely-you can move. At the heart of this digital engine lies a crucial but often underestimated component: the server. More specifically-reliable dedicated servers.

Community Vigilance, Enterprise Response: Addressing CVE-2024-21626 in Rancher

In backend engineering, many days follow a familiar rhythm: coffee, code reviews, maybe deploying a new feature. But occasionally, the routine is interrupted by a message that signals a different kind of challenge, like a Slack notification from the security team: “Hey, we’ve identified a potential issue. Need to sync up.” This post details one such instance—our journey addressing CVE-2024-21626, a privilege escalation vulnerability reported in Rancher.

Fleet management with Landscape for Ubuntu Core

Shipping your Ubuntu Core IoT device is just the beginning, managing it at scale is where the real challenge begins. In this session, Michael Croft-White (Engineering Director at Canonical) walks through how Landscape, Canonical’s fleet management tool, helps you keep devices updated, secure, and properly configured throughout their lifecycle.

Docker Hardened Images for tightened security and strong provenance

Docker's VP of Product, Michael Donovan, gives a quick overview of Docker Hardened Images and how they make open source software available in a hardened image container. They're minimal images with less attack surface and SLSA level 3 artifact compliance. They carry extensive provenance data, including SBOMs, CVEs, and VEX. Be confident that your software is safer from attack using Docker Hardened Images and Cloudsmith.

How to Log Into a Docker Container

When your Docker container isn't behaving the way you expect, you need to get inside and see what's going on. Maybe your app is throwing errors, a service won't start, or you just need to check some configuration files. Getting into a running Docker container is simpler than you might think, but there are several ways to do it depending on your situation. This guide shows you exactly how to log into Docker containers, troubleshoot common issues, and debug your applications effectively.

Tracking Down Object Changes in Flyway Migration Files | The Tony and Tonie Show

Ever spent hours digging through a pile of Flyway migration files, trying to figure out when a table, view, or procedure changed, and how? In this episode, Tony and Tonie explore Phil Factor’s time-saving PowerShell solution that scans the files for you in version order and pulls out the relevant DDL changes.

Cloud Workload Management: What It Is And How To Do It

The cloud gave us agility, but it also introduced fragmentation. And in most companies, no one’s fully owning the sprawl. One team deploys a new service in a hurry. Another forgets to shut down a dev environment. Meanwhile, batch jobs run 24/7 on oversized instances. And no one quite knows why your bill is $10K higher this month. The result? A growing source of cost overruns, performance headaches, and operational inefficiencies. This is exactly why cloud workload management is so crucial.

Reimagining the Data Centre: Respect, Resources, and the Path to a Sustainable Digital Future

When most people picture a data centre, they might think of a bland industrial shell—cold, humming, and forgettable. But look closer, and you’ll find something far more profound. A data centre is a heavy, complex, and finely tuned space where an immense volume of information flows, transforms, and creates the digital world we rely on. In many ways, it feels like magic. And yet, so does nature. Both deserve respect.

Deploy Istio at Scale With Rancher

Managing and deploying applications across multiple Kubernetes clusters presents significant challenges, especially as the number of clusters grows. Traditional methods, like manually applying Helm charts or manifests per cluster, become cumbersome, error-prone, and difficult to scale or maintain consistency for Day 2 operations. While Rancher allows managing Helm chart repositories and apps, this is done on a per-cluster basis via the UI.

How CloudZero's OpenAI Integration Provides Unprecedented AI Unit Economic Insights

AI spending continues to accelerate. In 2025, experts project that companies will collectively spend about $644 billion on generative AI alone — a whopping 76.4% increase from 2024. This puts it a mere $79 billion behind the public cloud as a whole, signaling the most seismic interval of new infrastructure investment since the dawn of the public cloud.

Unlocking Real-Time Collaboration: Why Your Network Is the Key to Vibe Working

Lately, there has been a growing buzz around the concept of “Vibe Working,” where teams are leveraging AI to dynamically share, develop, test, and transform “fuzzy” ideas into something useful in real-time. I view this approach as one of the next significant evolutions in our professional and technological landscape. Reflecting on my own journey in technology, I’ve observed how the pace of innovation and collaboration continually reshapes our daily workflows.

Is AI already replacing me? Insights from Civo Navigate

With all the rapid advancements in machine learning and AI, it can feel like we’re constantly playing catch-up. Over the last two Civo Navigate conferences, Berlin 2024 and San Francisco 2025, Civo brought together leading experts to discuss the future of AI, machine learning, and the growing challenges and opportunities for developers and businesses.

Graylog vs ELK: Which Log Management Solution Fits Your Stack?

Your app logs start simple—maybe a few print() or logging.info() calls. But in production, things get noisy. Thousands of log lines per minute, scattered across services, and it’s hard to know what matters. This is when tools like Graylog and the ELK stack help. They let you collect, search, and make sense of logs, but they do it in different ways. This guide breaks down how each one handles setup, scale, and day-to-day use.

How to Monitor and Manage Grafana Memory

It’s late, you get an alert, and Grafana is down. The reason? It ran out of memory. If you’ve ever watched Grafana slowly eat up RAM until it just stops responding, you know how frustrating that can be. Memory can spike quickly, especially with complex dashboards and multiple data sources. This guide will help you understand what’s going on and how to keep Grafana running without surprises.

AI Service Desk Showdown: RITA vs. Legacy Chatbots

Over the last decade, IT teams have leaned on chatbots to manage rising support volumes, aiming to deflect tickets and lighten the burden on overworked service desk agents. These legacy chatbots served a purpose—but today they’ve hit a wall. Static responses and script-based flows can’t keep pace with the expectations of modern digital workers or the dynamic needs of enterprise IT. That’s where a new kind of intelligence emerges.

Unleashing Efficiency: Top Benefits of Data Center Tracking Software

This post breaks down the core benefits of data center tracking software. You’ll learn how it improves asset management, enhances physical and digital security, drives cost savings, and increases efficiency in real-world settings. Plus, find out what to consider when selecting the right solution for your facility and what the future holds as data centers lean further into automation and AI.

Michael Donovan, VP of Product at Docker, has a hot take on shift left security

Shift left means improving security at the early stages of software development. Is it the best approach? See the full webinar: https:/cloudsmith.com/webinars Get to know Cloudsmith: About Cloudsmith We offer the world's best cloud-native artifact management platform to control, secure, and distribute everything that flows through your software supply chain. Cloudsmith operates at enterprise scale, reduces risk, and streamlines builds.

New Features in SQL Server 2025: AI, Performance, and Cloud Integration

Most databases were built for a slower world, one where data waited, systems pulled, and answers came later. But today, data flows. It triggers real-time decisions, powers models in motion, and spans every cloud layer. This shift demands a different kind of database—one built to execute, not just store. Microsoft has answered that call with SQL Server 2025.

Prometheus Alerting Examples for Developers

Everything looks fine—dashboards are green, logs are quiet. But users start reporting slow response times. No errors, no traffic spikes. Just a general slowdown. It’s a common situation. Not all problems show up as crashes or clear failures. Sometimes, performance degrades quietly, and standard metrics don’t catch it early. But that's where Prometheus alerting can help, if you're monitoring the right signals.

Jaeger vs Zipkin: Which is Right for Your Distributed Tracing

When requests slow down across your microservices, tracing helps you understand where time is spent. Jaeger and Zipkin are two popular tools for distributed tracing, built to answer a simple question: where did the request go? If you're choosing between them or just exploring options, this guide breaks down the differences and when each one might be a better fit.

Introducing CloudZero Optimize: Built For Engineers, Backed By Context, Designed For Real Results

We’re entering a new era at CloudZero, and we’re coming in hot. This week, we’re not just launching a new feature. We’re taking a major step forward in our mission to make cloud cost efficiency a seamless, engineer-first, business-aligned reality. My team and I are thrilled to introduce CloudZero Optimize, our smartest, most action-oriented optimization solution yet.

Why database observability is key to successful cloud data platform adoption

Data is the lifeblood of businesses the world over, from the smallest startup to the largest enterprise. Making sure that it’s available when you need it, secured for authorized use, and recoverable from faults is vital to operating data platforms, no matter where your business is on its cloud journey. This can only be achieved by putting the right data into the hands of the right people, in a timely way, to make the right decisions about how to manage that platform effectively.

System Hardening Explained: Types, Techniques, Examples & Mistakes to Know

The broad umbrella of today's IT security includes standards, tools, technologies, and human practices that reduce risk and protect your systems. System hardening is one conceptual catch-all for those components of IT security — but what does system hardening mean in relation to your actual day-to-day operations? And how do you achieve system hardening without burdening your whole team?