Operations | Monitoring | ITSM | DevOps | Cloud

Elephant Flows: The Hidden Heavyweights of AI Data Center Networks

Elephant flows are no longer rare. They’re foundational to AI workloads. In today’s GPU-heavy data centers, long-lived, high-volume flows can distort ECMP, overflow buffers, and rack up unexpected cloud bills. Kentik helps you see and tame these elephants with real-time flow analytics, automated alerting, and predictive capacity planning.

Can Claude Code Observe Its Own Code?

One of the great things about OpenTelemetry is that it’s a standard, and standards tend to proliferate. I was excited to see Claude Code add OpenTelemetry metric and log support in a recent release. What was really interesting—beyond the ability to capture usage data from Claude Code—is that you can also get pretty detailed logs about what you’re doing with Claude Code.

Making AI scalable with database change management and Redgate Flyway

With the rise of AI and machine learning comes data. Lots of it. For organizations today, AI is radically changing the way data is accessed, maintained and operationalized. For heads of architecture and development teams, it offers opportunity and responsibility.

Perform Distributed Tracing for your MCP system with OpenTelemetry

2025 has truly been the year of Agentic AI, with MCP (Model Context Protocol) emerging as one of its flashy and most talked-about innovations. While many products have seamlessly integrated MCP servers into their systems, these servers are increasingly being labelled as black boxes, opaque components that handle critical tasks but offer little visibility into what’s happening under the hood. We prompt an agent, a tool gets invoked, and a response is generated. But what really happens in between? And when something breaks, how do we trace the failure and debug it effectively?
Sponsored Post

Almaden CEO Leandro Silva Joins Key Discussion on the Digital Future of Business and the Role of AI

On the morning of Thursday, June 12, the São Paulo office of L.O. Baptista Advogados hosted a high-level event titled "Innovation and AI: The Digital Future of Business." The gathering brought together a diverse and engaged audience of legal and tech professionals to discuss how artificial intelligence is reshaping strategic decisions and transforming modern enterprises. Among the featured speakers was Leandro Silva, CEO of Almaden Inc., who joined a dynamic and interactive panel exploring the opportunities and challenges of using AI responsibly in corporate environments.

Going beyond AI chat response: How we're building an agentic system to drive Grafana

As we look at the role AI can play in Grafana going forward, we want to move beyond the simple chat responses that dominate the world of LLMs today and into agentic systems—AI that can understand, reason, and act on your behalf. The ultimate goal is to make it easy to get things done in Grafana using natural language—whether you’re a seasoned SRE or a new developer. And in the AI world, we call this moving from chat completion to task completion.

From Detection to Resolution: How Selector + Itential Deliver AI-Driven Observability and Automated Recovery

Every second counts when it comes to detecting, diagnosing, and resolving network incidents, yet many teams still find themselves stuck in reactive mode, drowning in alerts, manually writing scripts, and managing tickets across disconnected systems. This is where Selector and Itential come in. Together, Selector and Itential deliver a powerful, enterprise-ready solution that closes the loop between detection and action.

How Puppet is Redefining Infrastructure Management with AI, Powered by Perforce Intelligence

AI has emerged as a defining force in modern technology, spearheading transformation across industries. Yet, despite its promise to revolutionize workflows and unlock unprecedented efficiency, most DevOps organizations face significant hurdles in adopting AI safely and effectively. Concerns about complexity, scalability, and governance hold many decision makers back.

Demo Roundups! Meet the PagerDuty AI Agents

Welcome to the future of operations, where people and agents manage critical work together, driving productivity and efficiency. Learn how PagerDuty’s AI agents can supercharge teams, by autonomously handling repetitive tasks and resolving well-known issues, while surfacing data and insights that augment human expertise for faster resolution and higher operational resilience.