Operations | Monitoring | ITSM | DevOps | Cloud

Sponsored Post

How to Configure OpenTelemetry as an Agent with the Carbon Exporter

If you're already using OpenTelemetry for tracing and logs, adding otelcol-contrib as an agent for system metrics just makes sense. It keeps everything in the same pipeline, so you're not juggling multiple monitoring tools or dealing with inconsistent data formats. Plus, with built-in support for host metrics, custom processing, and direct exports to Graphite, it's a solid way to ship performance data without extra overhead. In this article, we'll detail how to install the OpenTelemetry Collector Contrib distribution, and configure it to export system performance metrics to a Graphite datasource.
Sponsored Post

Fabrix.ai Demo Day Showcases Agentic Platform and AGNTCY Collective Ecosystem Alliance

Fabrix.ai, a pioneer in enterprise-ready agentic AI solutions, successfully hosted its highly anticipated Agentic AI Demo Day yesterday, bringing together IT operations, NOC operations, and AI operations professionals for a comprehensive showcase of its Purpose-built Agentic AI Operational Intelligence Platform.

Unlock Cheaper & Faster AI Testing: Mocking Claude and MCP

Generative AI is quickly becoming ubiquitous in the software development space, with tools like Anthropic’s Claude offering rapid methodologies for code iteration, testing, and deployment. As new solutions, such as MCP (Model Context Protocol), are created to make integration more seamless, enterprises are adopting these AI solutions to optimize their development processes, a familiar challenge repeatedly arises: cost.

Everything You Need to Know About OpenTelemetry Histograms

Modern systems throw off a lot of data—metrics, traces, logs—sometimes more than we know what to do with. When you're trying to understand how values spread out over time (like response times, memory usage, or queue lengths), averages alone don’t tell the full story. OpenTelemetry histograms help fill in those gaps. This guide walks through what they are, why they matter, and how DevOps engineers can use them to improve observability in real systems.

Correlation ID vs Trace ID: Understanding the Key Differences

You’re staring at logs, trying to figure out what caused that odd error in the middle of the night. Or maybe you're following a chain of requests across services, hoping to understand how one user action triggered a series of unexpected behaviors. That’s where distributed tracing and request tracking—specifically, correlation IDs and trace IDs—are invaluable. It’s the kind of detail that can make debugging faster and less painful.

Agentic AI and How It is Transforming Customer Service

Customer service has gone through a fundamental transformation in the last few years. What started as reactive support through call centers has evolved into AI-driven interactions designed for speed, efficiency, and personalization. According to McKinsey, integrating generative AI into customer care functions can drive productivity gains of 30-45%. Agentic AI is taking this further by automating entire workflows.

Pager fatigue: Making the invisible work visible

As much as you try to prevent it, your product will break sometimes. While you hope it would have the decency to do so while you are awake and already working, sometimes the product is inconsiderate and decides to break outside your office hours. Being woken up from a page at 3 am sucks, and being woken up again two hours later (when you get pinged for a follow-up issue you missed the first time) sucks even more.

JFrog's Journey with AWS Graviton

Every business strives to optimize operational costs and efficiency. In the DevOps world, where cloud-scale operations are the norm, this becomes even more critical. At JFrog, while delivering a robust and highly scalable SaaS solution to our customers, we are equally focused on optimizing operational costs and maximizing infrastructure efficiency.