Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

AI Agents Are the New Employees: The Identity & Security Crisis Enterprise IT Must Solve

As AI agents become more autonomous, enterprises face a new challenge: How do you secure a workforce that isn't human? In this episode of Agents of IT, Fran Fernandez, Zach Austin, and Ian Coppock explore the growing identity and security challenges surrounding Agentic AI. From permissions and governance to digital identities and access controls, the team breaks down what enterprise leaders need to know before deploying AI agents at scale.

How to Fix Azure Integration Errors in Minutes Instead of Days

Azure integration errors can be difficult to diagnose when messages flow across multiple services such as Logic Apps, Service Bus, Azure Functions, APIs, and external systems. Support teams often spend hours searching through logs and correlating events across services just to identify where a transaction failed.

Why Multi-Agent AI Workflows Need a Control Plane

AI is transforming how infrastructure and platform teams design, deploy, and operate systems. As organizations move from experimentation to production, a clear pattern is emerging. AI can decide what should change, but it cannot safely control how those changes are executed. This creates a gap in modern architectures. That gap is filled by a control plane. That control plane already exists in Puppet Enterprise Advanced.

Why Day 2 Operations Are Harder Than Deployment (And What To Do About It)

Getting your application deployed feels like finishing a race. You push the code, the containers spin up, the health checks go green, and for a brief moment everything feels solved. Then Day 2 arrives. Day 2 is not a specific date. It is the entire operational life of your application after that first successful deployment. It is the stretch of time that can last years, and it is where most teams quietly discover that deployment was the easy part.

How one PM scaled customer discovery with AI

Customer interviews are one of the most powerful ways to build better products — but they’re also time-consuming. In this video, Avinoam “Avi” Zelenko, Principal Product Manager at Atlassian, shares how he transformed the way he runs customer interviews using AI automation and Rovo agents. What used to take hours of coordination, note-taking, and manual summaries now happens automatically. By stitching together the Teamwork Collection and Slack, Avi built a workflow that captures conversations, summarizes insights, and shares them across teams in real time.

The sovereignty debate explained with Nine23

Who really owns your data? Data sovereignty has become one of the defining issues shaping digital infrastructure, cloud strategy and AI adoption. But what does it actually mean, and why has it become a board-level discussion for so many organisations? In Episode 4 of Perspectives from the Edge, Pulsant's Wendy Shearer is joined by Steve Jewell, CEO of Nine23, to explore data sovereignty and its relationship to security, resilience and digital transformation.

Reduce Alert Fatigue with Composite Alerting in Hosted Graphite | Tutorial

Tired of noisy alerts waking you up for issues that are not actually impacting your services? In this tutorial, we walk through MetricFire's Composite Alerting capabilities and show how to combine multiple metric conditions into a single high-confidence alert using AND / OR logic. Learn how to: Reduce alert fatigue and false positives Create service level alerts in Graphite Combine CPU, latency, and database metrics into meaningful alerts Use conditional logic to improve signal quality Build smarter observability workflows with Hosted Graphite.

We wrote the docs

Most security vendors hide their documentation behind a login. Some don’t write it at all. You get a sales page, a demo, and a request to install an agent on your servers, and you’re expected to trust that the thing does what the marketing says. That’s backwards. So we wrote the docs, and we put all of them at certkit.io/docs. No login, no account gate, no “contact us for details.” You can read every page before you create an account.

Real-Time CPU and Memory Insights for Harness CI Cloud Builds | Harness Blog

When a CI pipeline runs on cloud infrastructure, the build machine is ephemeral. It spins up, executes your build, and disappears. During that window, you have zero visibility into how much CPU and memory your pipeline actually consumes. This blind spot creates real problems. Teams over-provision VMs "just in case," wasting compute spend. Others under-provision and deal with silent OOM-kills or CPU throttling — the only clue being a cryptic exit code 137.

What Is Coherent Routing?

Coherent routing, Routed Optical Networking, Converged Optical Routing Architecture (CORA), are all names for the same concept: an advanced network architecture which integrates coherent optical transceivers directly into IP routers. This convergence of layers creates a simplified and highly efficient IP-over-DWDM (IP over Dense Wavelength Division Multiplexing) network.