Monthly Archive

This Month in Datadog - July 2025

Jul 31, 2025 By Datadog In Datadog

In July’s episode of This Month in Datadog, we’re doing things differently by spotlighting the people behind the products you rely on. Jeremy is joined by Tristan Ratchford to discuss saving time and effort when you’re on call with Bits AI SRE, and by Kevin Hu to explore gaining visibility into datasets across the entire data lifecycle with Data Observability.

Read Post

Datadog

Read more about This Month in Datadog - July 2025

Out-of-the-box Alerting for Frontend Observability in Grafana Cloud

Jul 31, 2025 By Grafana In Grafana

Get alerted on frontend issues the moment they happen — no setup headaches required. In this short demo, Elliot Kirk from Grafana Labs introduces out-of-the-box alerting for frontend observability. Whether you're tracking error counts or web vitals, this new feature makes it easy to stay ahead of performance issues. With just a few clicks, you can: Enable prebuilt alerts for your apps Visualize and edit alerts directly in the UI Customize thresholds and durations Set up notifications and stay in the loop Launch alerting with every new app setup.

View Video

Grafana

Read more about Out-of-the-box Alerting for Frontend Observability in Grafana Cloud

Bring high-performance observability to secure Kubernetes environments with Datadog's new CSI driver

Jul 30, 2025 By Adel Haj Hassan In Datadog

In Kubernetes environments, applications often communicate with the Datadog Agent to send telemetry data such as custom metrics via DogStatsD or traces through Datadog APM. How this communication takes place depends on the communication mode set on the Datadog Cluster Agent's Admission Controller. With the sockets option, communication takes place through local inter-process communication via Unix domain sockets (UDS), whereas the service and default hostip options rely on network communication.

Read Post

Datadog

Read more about Bring high-performance observability to secure Kubernetes environments with Datadog's new CSI driver

Integrating CI/CD Pipelines with Observability Tools

Jul 30, 2025 By Alexandr Bandurchin In Uptrace

CI/CD pipelines are automated workflows that take code from development to production. The CI/CD pipeline meaning encompasses two key practices: A typical CI/CD pipeline includes stages like code compilation, testing, security scanning, artifact creation, and deployment across multiple environments.

Read Post

Uptrace

Read more about Integrating CI/CD Pipelines with Observability Tools

Why Observability Isn't Just for SREs (and How Devs Can Get Started)

Jul 30, 2025 By Elizabeth Mathew In SigNoz

Almost every other day, when I scroll past r/devops or r/sre, I see a post like this asking how a dev can get started with devops, observability, etc. Sample Reddit thread on how to get started with OTel This blog is an attempt for anyone lost to find their way into observability and a wake-up call for devs to they should think about observability more actively today than ever before. A dev’s observability playbook.

Read Post

SigNoz

Read more about Why Observability Isn't Just for SREs (and How Devs Can Get Started)

This Month in Datadog: Bits AI SRE, Datadog Data Observability, and more

Jul 30, 2025 By Datadog In Datadog

Datadog is constantly elevating the approach to cloud monitoring and security. This Month in Datadog updates you on our newest product features, announcements, resources, and events. To learn more about Datadog and start a free 14-day trial, visit Cloud Monitoring as a Service | Datadog. This month, we chat with two guests about Bits AI SRE and Datadog Data Observability.

View Video

Datadog

Read more about This Month in Datadog: Bits AI SRE, Datadog Data Observability, and more

Disposable Code Is Here to Stay, but Durable Code Is What Runs the World

Jul 29, 2025 By Charity Majors In Honeycomb

Every day I seem to run into yet another post with someone solemnly opining that “writing code has never been the hardest part of software engineering. And hey, that’s smashing. As an engineer from the ops/infra/SRE side of the house, I feel like I’ve been saying this my whole career. (Is there anything more satisfying than being proven right in public? Not in my book.) So, which is it?

Read Post

Honeycomb

Read more about Disposable Code Is Here to Stay, but Durable Code Is What Runs the World

Data Observability: Build confidence in the data life cycle

Jul 29, 2025 By Datadog In Datadog

Datadog Data Observability provides a complete solution with quality checks (e.g., volume, row changes, freshness), custom SQL-based monitors, anomaly detection, column-level lineage across systems like Snowflake and Tableau, full pipeline visibility, and targeted alerts when data issues arise.

View Video

Datadog

Read more about Data Observability: Build confidence in the data life cycle

How to monitor and manage front-end observability in Blackfire

Jul 29, 2025 By Upsun In Upsun

In this video, we'll guide you through the process of monitoring and managing your usage of front-end observability features in Blackfire. Learn how to access your Browser usage dashboard to view browser traces collected per environment, track your quota consumption, and understand the concept of spike protection. You'll discover how Blackfire's automatic detection of abnormal traffic spikes protects your monthly quota and ensures continuous data collection.

View Video

Upsun

Read more about How to monitor and manage front-end observability in Blackfire

How to Enable and Configure Front-end Observability in Blackfire

Jul 29, 2025 By Upsun In Upsun

In this video, learn how to enable and configure Front-end Observability in Blackfire. The tutorial covers steps to enable features across multiple environments via the Organization settings / Front-end usage in the Blackfire dashboard. Control front-end observability by enabling or disabling Browser Monitoring and Analytics per environment, using a JavaScript probe and a unique browser key. The video emphasizes the importance of naming transactions and explains how to manually add tracking snippets to HTML for better control.

View Video

Upsun

Read more about How to Enable and Configure Front-end Observability in Blackfire

What is Grafana Cloud? Fully Managed Observability Built on Open Standards | Grafana Labs

Jul 29, 2025 By Grafana In Grafana

Grafana Cloud helps teams detect, investigate, and resolve incidents faster—thanks to AI, open standards, and seamless integrations with OpenTelemetry, Prometheus, Salesforce, and more. See how it all works in this live demo of a simulated e-commerce outage.

View Video

Grafana

Read more about What is Grafana Cloud? Fully Managed Observability Built on Open Standards | Grafana Labs

Unifying Observability: Intelligence, Automation, and Insights in Action

Jul 28, 2025 By ScienceLogic In ScienceLogic

As enterprise IT environments evolve into ever-greater complexity and scale, demands on operations teams are accelerating. In the traditional model, observability tools collect data, engineers manually correlate events, and remediation follows a ticketing trail. However, that approach no longer matches the speed and scale of today’s digital businesses. Even the most storied dashboards can’t address today’s operational needs.

Read Post

ScienceLogic

Read more about Unifying Observability: Intelligence, Automation, and Insights in Action

How I Use GenAI as a Thought Partner, Not a Shortcut

Jul 28, 2025 By Katie Leonard In Honeycomb

You don’t need to be a power user to get powerful results. I’m not training models or prompting GPTs into poetry—I’m just using them to do what great managers already try to do: communicate clearly, prioritize outcomes, and lead with intention. Over the last few quarters, I’ve built a handful of custom GPTs to support my weekly, monthly, and quarterly workflows.

Read Post

Honeycomb

Read more about How I Use GenAI as a Thought Partner, Not a Shortcut

Why continuous profiling is the fourth pillar of observability

Jul 25, 2025 By Marcus Hirt In Datadog

Developers have long used profilers to diagnose performance bottlenecks and improve the efficiency of their code. But a modern version of profiling, continuous profiling, is quietly redefining what profiling is and what it can do. By running nonstop in production with very low overhead, continuous profilers give teams always-on visibility into how their code behaves in the real world.

Read Post

Datadog

Read more about Why continuous profiling is the fourth pillar of observability

Observability Data: Ingestion Pipeline Best Practices

Jul 25, 2025 By Robert Gauthier In Broadcom

Great data is a prerequisite to all things AIOps and observability. Great observability data results in fewer observability gaps, better analysis and insights, and more confidence within teams that rely on the power of modern AIOps and observability technologies. Goals for improved automation, IT efficiencies, intelligent triage and remediation all become more achievable with better data.

Read Post

Broadcom

Read more about Observability Data: Ingestion Pipeline Best Practices

Tutorial: Visualize Your Puppet Data in Grafana with the Observability Data Connector

Jul 24, 2025 By Puppet In Puppet

When you manage complex IT infrastructure, it becomes critical to use tooling to understand what’s happening across all of your systems in terms of performance, reliability, and compliance. Monitoring key indicators manually is simply no longer possible at that scale. Puppet has long been known as a solution for managing large environments and collecting a vast amount of data about your infrastructure, but accessing and visualizing that data in a meaningful way can be a challenge.

Read Post

Puppet

Read more about Tutorial: Visualize Your Puppet Data in Grafana with the Observability Data Connector

AWS Summit NYC 2025: Laser-Focused on AI

Jul 24, 2025 By Ken Rimple In Honeycomb

If you’re unfamiliar with AWS Summits, these are conferences that occur on a yearly basis in different cities. The events are mostly used to announce new products and technologies. This year, the theme was AI, as evidenced by the keynote, a large majority of the talks, and a walk around the vendor floor. The keynote talk was hosted by Swami Sivasubramanian, VP of Agentic AI at AWS.

Read Post

Honeycomb

Read more about AWS Summit NYC 2025: Laser-Focused on AI

How SAP achieved world-class uptime through modern observability

Jul 23, 2025 By Gerardo Dada In Catchpoint

SAP Customer Experience (CX) has undergone a remarkable transformation over recent years, evolving from fragmented monitoring to a scalable, automated observability powerhouse. In a recent fireside chat, Martin Norato Auer, SAP CX’s VP of Observability, shed light on the strategies, practices, and measurable impacts behind SAP’s SLA, uptime, and responsiveness achievements.

Read Post

Catchpoint

Read more about How SAP achieved world-class uptime through modern observability

Anatomy of AI-powered Root Cause Analysis

Jul 23, 2025 By Nikolay Sivko In Coroot

AI is being used to automate just about everything these days, from writing code to making coffee. Observability is no exception. But before we dive into how AI can actually help, it is worth stepping back to look at what already works, what does not, and where the real gaps are.

Read Post

Coroot

Read more about Anatomy of AI-powered Root Cause Analysis

Architecting for Value: A Playbook for Sustainable Observability

Jul 23, 2025 By Mezmo In Mezmo

You’ve built something amazing. Your services are scaling, your users are happy, and your team is shipping code like never before. Then the cloud bill arrives, and one line item makes your eyes water: observability. That Datadog invoice feels less like a utility bill and more like a ransom note. It’s a modern engineering paradox. The tools that give you sight into your complex systems are the same ones that can blind you with runaway costs.

Read Post

Mezmo

Read more about Architecting for Value: A Playbook for Sustainable Observability

The one where we talk about what's next for Cribl U!

Jul 22, 2025 By Cribl In Cribl

What's next for Cribl University? Tune in to find out.

View Video

Cribl

Read more about The one where we talk about what's next for Cribl U!

How to improve observability with fast log analysis (using FOSS!)

Jul 22, 2025 By Coroot In Coroot

Log analysis can take only seconds (not hours) with time-mapped heat graphs, pattern clustering and analysis, and errors sorted by severity.

View Video

Coroot

Read more about How to improve observability with fast log analysis (using FOSS!)

Ship Confluent Cloud Observability in Minutes

Jul 22, 2025 By Anjali Udasi In Last9

You're running Kafka on Confluent Cloud. You care about lag, throughput, retries, and replication. But where do you see those metrics? Confluent gives you metrics, sure, but not all in one place. Some live behind a metrics API, others behind Connect clusters or Schema Registries. You either wire them manually or give up. What if you could stream those metrics to a platform built for high-frequency, high-cardinality time series, and do it in minutes?

Read Post

Last9

Read more about Ship Confluent Cloud Observability in Minutes

How to Cut Observability Costs with Synthetic Monitoring and Responsive Pipelines

Jul 22, 2025 By Mezmo In Mezmo

Platform teams are struggling with observability noise, bloated storage costs, and lack of clarity during incidents. Most teams capture everything all the time, leading to expensive, overwhelming, and often unnecessary data volumes. In Telemetry for Modern Apps, Mezmo teamed up with Checkly to demonstrate how synthetic monitoring triggers and responsive telemetry pipelines can help reduce costs while maintaining the context needed during incidents.

Read Post

Mezmo

Read more about How to Cut Observability Costs with Synthetic Monitoring and Responsive Pipelines

Streamlining multi-cloud complexity with unified observability

Jul 21, 2025 By ManageEngine In ManageEngine

A wave of businesses are embracing multi-cloud strategies to gain flexibility and scalability. By combining on-premises infrastructure, private clouds, and public platforms like AWS, Azure, and Google Cloud Platform (GCP), IT teams can experiment, deploy, transform, and improve their IT applications significantly. On the down side, this modern IT approach of employing multiple clouds (in both public and private forms) also brings significant complexity, making it challenging to monitor systems, control costs, and secure environments. There are just too many threads to track and tie together to ensure a taut IT fabric.

Read Post

ManageEngine

Read more about Streamlining multi-cloud complexity with unified observability

Will AI Speed Development in Your Legacy App?

Jul 21, 2025 By Jessica Kerr In Honeycomb

Some people can get an AI assistant to write a day’s worth of useful code in ten minutes. Others among us can only watch it crank out hundreds of lines of crap that never works. What’s the difference? There are some skills specific to AI development. There are also properties of the codebase we’re working in that make it amenable to AI assistance. Most AI demos use projects created from scratch with AI in mind—cute.

Read Post

Honeycomb

Read more about Will AI Speed Development in Your Legacy App?

I built an MCP Server for Observability. This is my Unhyped Take

Jul 18, 2025 By Elizabeth Mathew In SigNoz

Recently, I read a blog titled “It’s The End Of Observability As We Know It (And I Feel Fine)”, which discussed MCP servers in observability and how these systems would potentially be the “end of observability”. As someone who has spun up an MCP server for an observability backend and as someone who has been in the space for a while, I certainly do not think so.

Read Post

SigNoz

Read more about I built an MCP Server for Observability. This is my Unhyped Take

Cloud or Self-Hosted - Which Deployment Model is Right For You?

Jul 18, 2025 By Anushka Karmakar In SigNoz

Choosing the right observability platform is a critical decision. But how you deploy it is just as important. The right deployment strategy can accelerate your team, simplify operations, and ensure you meet compliance and security requirements. The wrong one can lead to operational headaches and slow you down. At SigNoz, we believe in flexibility. There is no single "best" way to deploy an observability platform; there's only the way that's best for you.

Read Post

SigNoz

Read more about Cloud or Self-Hosted - Which Deployment Model is Right For You?

Honeycomb Named a Visionary in the 2025 Gartner Magic Quadrant for Observability Platforms

Jul 17, 2025 By Julie Neumann In Honeycomb

In the era of AI, software development is at an inflection point, and observability has never been more critical. Teams are dealing with more code, more data, and more pressure than ever before. To navigate these new challenges, you need a partner with a strong vision for the future and a knack for looking around corners. Honeycomb is proud to be named a Visionary in the 2025 Gartner Magic Quadrant for Observability Platforms.

Read Post

Honeycomb

Read more about Honeycomb Named a Visionary in the 2025 Gartner Magic Quadrant for Observability Platforms

Honeycomb In Your IDE? Yes, With Hosted MCP Now Available in AWS Marketplace AI Agents and Tools Category

Jul 16, 2025 By Austin Parker In Honeycomb

I’m pleased to announce the public beta of Honeycomb Hosted MCP, along with our first wave of one-click integrations for Cursor, Visual Studio Code, and Claude Desktop. We’re also very excited to announce that Hosted MCP is available on AWS AI Agents marketplace and for all Honeycomb plans (including our free plan!) at no charge. Honeycomb was built with a singular focus: how do we help teams become better at the art and craft of software development, delivery, and operations?

Read Post

Honeycomb

Read more about Honeycomb In Your IDE? Yes, With Hosted MCP Now Available in AWS Marketplace AI Agents and Tools Category

ITRS named in Gartner Magic Quadrant for Observability Platforms

Jul 16, 2025 By Uptrends In Uptrends

When Uptrends became part of ITRS, we knew we were joining a team deeply committed to innovation, precision, and people — whether those people were troubleshooting transaction journeys from their laptops at 8am or keeping enterprise-scale operations online 24x7. We’ve come far since then.

Read Post

Uptrends

Read more about ITRS named in Gartner Magic Quadrant for Observability Platforms

ScienceLogic Named a Visionary in the 2025 Gartner Magic Quadrant for Observability Platforms

Jul 15, 2025 By ScienceLogic In ScienceLogic

It’s official: ScienceLogic has entered the observability arena. Named a Visionary in the 2025 Gartner Magic Quadrant for Observability Platforms, we believe we’re helping define where observability is heading, not just where it’s been. This marks our first inclusion in this Magic Quadrant and, in our opinion, validates our mission to redefine intelligent, actionable observability in the era of AI and automation.

Read Post

ScienceLogic

Read more about ScienceLogic Named a Visionary in the 2025 Gartner Magic Quadrant for Observability Platforms

Kubernetes Monitoring backend 2.2: better cluster observability through new alert and recording rules

Jul 15, 2025 By Serena Kei In Grafana

We’re excited to announce version 2.2.0 of the backend for our Kubernetes Monitoring solution in Grafana Cloud is now available. The app’s backend is supported by kubernetes-mixin, an open source Prometheus Monitoring Mixin, and this latest version features significant improvements to alert rules and recording rules that will enhance your cluster observability and monitoring experience. There’s a lot to tell you about, so let’s dive in.

Read Post

Grafana

Read more about Kubernetes Monitoring backend 2.2: better cluster observability through new alert and recording rules

Monitor agents built on Amazon Bedrock with Datadog LLM Observability

Jul 15, 2025 By Barry Eom In Datadog

As large language models (LLMs) grow more powerful, organizations are deploying agentic AI applications to tackle complex, multi-step tasks. With Amazon Bedrock Agents, developers can orchestrate these agents to manage tasks such as triggering serverless functions, calling APIs, accessing knowledge bases, and maintaining contextual conversations—all while breaking down complex user requests or tasks into manageable steps.

Read Post

Datadog

Read more about Monitor agents built on Amazon Bedrock with Datadog LLM Observability

How to Troubleshoot Outages Faster Using Elastic Observability [2 Min Live Demo]

Jul 15, 2025 By Elastic In Elastic

In this video, I’ll show you how Elastic Observability helps you reduce downtime, accelerate root cause analysis, and unify logs, metrics, and traces in one powerful dashboard. With native OpenTelemetry support, AI-powered troubleshooting, and built-in anomaly detection, you can streamline your workflows and boost service reliability.

View Video

Elastic

Read more about How to Troubleshoot Outages Faster Using Elastic Observability [2 Min Live Demo]

Arie's Adventures with Coroot

Jul 15, 2025 By Arie Van Den Heuvel In Coroot

Arie van den Heuvel is an engineer, a System and Application Management Specialist, and a valued member of our community. Below he has shared his journey using Coroot, and how it has helped improve observability for his team. You can read more of Arie’s writing and support the resource articles he has created for open source on his blog.

Read Post

Coroot

Read more about Arie's Adventures with Coroot

Splunk Named a Leader in the 2025 Gartner Magic Quadrant for Observability Platforms

Jul 15, 2025 By Dayna Lord In Splunk

We are proud to announce that Splunk has been named a Leader in the 2025 Gartner Magic Quadrant for Observability Platforms for the third year in a row. In our opinion, our recognition in the Observability category comes on the heels of Splunk being recognized for a tenth consecutive time as a Leader in the 2024 Gartner Magic Quadrant for Security Information and Event Management (SIEM). Splunk was the only vendor named a Leader in both SIEM and Observability for the Gartner Magic Quadrant three times.

Read Post

Splunk

Read more about Splunk Named a Leader in the 2025 Gartner Magic Quadrant for Observability Platforms

Climbing the Security Pyramid: From Awareness to Automation with AI and Observability

Jul 15, 2025 By OpsMatters In OpsMatters

Modern threats don't wait. They move fast, hide deep, and often strike without warning. That's why old-school security isn't enough anymore. You need more than firewalls and login rules. You need layers. You need clarity. And most of all, you need speed. This is where the security pyramid comes in. It shows how smart security stacks-from the ground up. It starts with awareness and ends with advanced tools like automation and AI. In this article, we'll break it down step by step-and show how observability and automation help you climb it.

Read Post

OpsMatters

Read more about Climbing the Security Pyramid: From Awareness to Automation with AI and Observability

Observability as Code: Why You Should You Use OaC

Jul 14, 2025 By Caitlin Halla In Splunk

Key takeaways In the fast-moving world of CI/CD pipelines, microservice architectures, and container orchestration, software changes rapidly. What exists in a codebase today might be gone next week. At this scale and speed, it’s impossible for development teams to manually track every line of code and every new piece of functionality.

Read Post

Splunk

Read more about Observability as Code: Why You Should You Use OaC

Uptrace v2.0: The Future of Observability is Here

Jul 14, 2025 By Vladimir Mihailenco In Uptrace

The Uptrace team is thrilled to announce the release of v2.0—our biggest update yet! This release represents a complete reimagining of how observability data should be stored, queried, and managed. With multi-project support, revolutionary JSON-based storage, powerful data transformations, and a host of developer-friendly features, Uptrace v2.0 is designed to scale with your growing infrastructure needs.

Read Post

Uptrace

Read more about Uptrace v2.0: The Future of Observability is Here

The Fast Path to More Useful Telemetry

Jul 14, 2025 By Bernardo Guerreiro In Honeycomb

Over and over, we’ve seen that teams who invest in adding rich, relevant context to their telemetry end up debugging faster and collaborating more effectively during incidents. Getting meaningful context added can feel like a big cross-team project, but some of the highest-leverage improvements don’t require app code changes or coordination across services.

Read Post

Honeycomb

Read more about The Fast Path to More Useful Telemetry

What Is Hybrid Observability? A Healthcare IT Explainer

Jul 10, 2025 By LogicMonitor In LogicMonitor

Healthcare IT environments have become incredibly complex. Think about everything running simultaneously in your organization: physical medical devices, cloud platforms, clinical applications like Epic, and patient-facing applications. Each component needs to work together seamlessly, much like how ICU monitors track multiple vital signs at once. Many healthcare organizations still use monitoring solutions designed for simpler times, when systems were more isolated.

Read Post

LogicMonitor

Read more about What Is Hybrid Observability? A Healthcare IT Explainer

Grafana Labs named a Leader again in the 2025 Gartner Magic Quadrant for Observability Platforms

Jul 10, 2025 By Jen Villa In Grafana

We’re thrilled to share that Grafana Labs has been recognized as a Leader in the 2025 Gartner Magic Quadrant for Observability Platforms—for the second year in a row. This year’s report placed Grafana Labs furthest in “Completeness of Vision,” which we believe reflects our deep commitment to building a truly open, composable observability stack that gives users flexibility, control, and the tools to own their observability strategy.

Read Post

Grafana

Read more about Grafana Labs named a Leader again in the 2025 Gartner Magic Quadrant for Observability Platforms

Elastic named a Leader in the 2025 Gartner Magic Quadrant for Observability Platforms

Jul 10, 2025 By Natalie Blake In Elastic

Observability has an investigation problem, and dashboards and alerts aren’t enough for solving problems in today’s complex systems. AI-driven capabilities, powerful analytics, and the ability to scale are essential to drive real-time investigations while keeping costs low. We think this is why Elastic has been named a Leader in the 2025 Gartner Magic Quadrant for Observability Platforms for the second time.

Read Post

Elastic

Read more about Elastic named a Leader in the 2025 Gartner Magic Quadrant for Observability Platforms

How to improve your observability

Jul 10, 2025 By Coroot In Coroot

Coroot was designed to solve the problem of time-consuming root cause analysis. It handles the full observability journey - from collecting telemetry automatically with zero code setup (thanks, eBPF!) to simplifying the role of SREs and DevOps everywhere with instant root cause analysis powered by AI. We also strongly believe that simple observability should be an innovation everyone can afford to benefit from: which is why our software is open source!

View Video

Coroot

Read more about How to improve your observability

What is a Data Lake, Data Warehouse, and a Data Lakehouse? (Learn the difference)

Jul 10, 2025 By Coroot In Coroot

Altinity, Inc. Developer Advocate Josh Lee walks us from an '80s IBM to a present day where columnar formats like and querying tools like Iceberg are often used to manage data.

View Video

Coroot

Read more about What is a Data Lake, Data Warehouse, and a Data Lakehouse? (Learn the difference)

Datadog named Leader in 2025 Gartner Magic Quadrant for Observability Platforms

Jul 10, 2025 By Yanbing Li In Datadog

We are thrilled to announce that, for the fifth consecutive year, Datadog has been named a Leader in the 2025 Gartner Magic Quadrant for Observability Platforms. We believe that this recognition reflects our continued focus on helping customers observe, secure, and act on everything that matters across their technology stack.

Read Post

Datadog

Read more about Datadog named Leader in 2025 Gartner Magic Quadrant for Observability Platforms

What Are Traces? A Developer's Guide to Distributed Tracing

Jul 10, 2025 By Rox Williams In Honeycomb

One of the most common challenges in modern software engineering today is understanding how requests flow through applications. As system architectures shift to favor widely distributed, cloud-native designs, keeping track of how an application processes user actions is more difficult than ever. A single user action may trigger events processed in dozens of backend services. Traces are helping software developers today with this challenge.

Read Post

Honeycomb

Read more about What Are Traces? A Developer's Guide to Distributed Tracing

The Inconvenient Truth About AI Ethics in Observability

Jul 10, 2025 By Mezmo In Mezmo

Let's be honest: most conversations about AI ethics sound like they're happening in a boardroom, not an ops room. But here's the thing, when you're using AI to make sense of your telemetry data, ethics isn't some abstract concept. It's the difference between insights you can trust and algorithmic noise that leads you down the wrong path. The uncomfortable reality? Your AI is only as ethical as the messiest, most biased piece of telemetry data you feed it. And if you think your data is clean, well...

Read Post

Mezmo

Read more about The Inconvenient Truth About AI Ethics in Observability

Grafana Labs is a Leader in the 2025 Gartner Magic Quadrant for Observability Platforms

Jul 10, 2025 By Grafana In Grafana

For the second year in a row, Grafana Labs has been named a Leader in the Gartner Magic Quadrant for Observability Platforms — and this year, we’re proud to be recognized as the furthest in Completeness of Vision. In this video, Grafana Labs CTO Tom Wilkie shares what this recognition means, why our scores for execution and vision both improved, and how it reflects years of building a truly open, composable observability stack.

View Video

Grafana

Read more about Grafana Labs is a Leader in the 2025 Gartner Magic Quadrant for Observability Platforms

Coralogix | Magic Quadrant 2025

Jul 10, 2025 By Ariel Assaraf In Coralogix

Today marks an exciting moment for all of us at Coralogix. We’re proud to share that Gartner has named us a Visionary in the 2025 Magic Quadrant for Observability Platforms. This recognition, we believe, reflects what we’ve been building toward for years: an observability platform that delivers scale, cost-efficiency, AI-powered insights, and tangible customer success.

Read Post

Coralogix

Read more about Coralogix | Magic Quadrant 2025

Honeycomb Users Are Living in the Future, Part 1: Sampling

Jul 9, 2025 By Irving Popovetsky In Honeycomb

When we talk to new Honeycomb users, a few things stand out as sounding downright magical. Sometimes we’ll hear, “Wow, is that a new feature?” and we’ll say that no, it’s been like that for years. Clearly we need to get the word out! This is the first installment of a blog series I’ll be writing, covering areas of Honeycomb that elicit reactions of awe and disbelief from new users.

Read Post

Honeycomb

Read more about Honeycomb Users Are Living in the Future, Part 1: Sampling

Lumigo Launches AI Agent Observability

Jul 9, 2025 By Orr Weinstein In Lumigo

LLM-powered agents are reshaping software, but when they fail, troubleshooting is guesswork. Lumigo’s new AI Agent Observability, now in beta, gives you visibility into the entire lifecycle of your agents, from prompt to response to internal decision logic. Built for modern AI workloads, this feature is designed to help engineers monitor, debug, and optimize agents running on platforms like OpenAI, Anthropic, and open-source models.

Read Post

Lumigo

Read more about Lumigo Launches AI Agent Observability

The one where Ed and Sydnee talk all about AI

Jul 9, 2025 By Cribl In Cribl

Join Ed Bailey, Principal Technical Evangelist and Sydnee Mayers, Sr. Staff Product Manager as they chat all about AI!

View Video

Cribl

Read more about The one where Ed and Sydnee talk all about AI

Observability for containerized workloads: How to run Grafana Beyla as a sidecar in Amazon ECS

Jul 9, 2025 By Matt Wimpelberg In Grafana

Note: Grafana Beyla has been donated to OpenTelemetry under the new project name OpenTelemetry eBPF Instrumentation. Beyla will continue to exist as Grafana Labs’ distribution of the upstream project. Grafana Beyla is an open source eBPF-based auto-instrumentation tool that helps you easily get started with application observability, allowing you to monitor and visualize traces without modifying the application code.

Read Post

Grafana

Read more about Observability for containerized workloads: How to run Grafana Beyla as a sidecar in Amazon ECS

Investigating High Partition Load in Honeycomb

Jul 9, 2025 By Honeycomb In Honeycomb

Here, Pierre Tessier shows how he looks into partition load in Honeycomb's distributed datastore in production.

View Video

Honeycomb

Read more about Investigating High Partition Load in Honeycomb

Monitoring & Observability Report Top Findings

Jul 8, 2025 By Fred Koopmans In BigPanda

Today, BigPanda released our first-ever research report based on data gathered from our agentic IT operations platform. Our Monitoring and Observability Tool Effectiveness for IT Event Management report provides insights and benchmarks on incident detection and noise reduction for 130 enterprise organizations, including the monitoring and observability data sources integrated with BigPanda.

Read Post

BigPanda

Read more about Monitoring & Observability Report Top Findings

Observability in under 5 seconds: Reflecting on a year of grafana/otel-lgtm

Jul 8, 2025 By Gregor Zeitlinger In Grafana

With grafana/otel-lgtm, observability is just one Docker command away. Over the past year, grafana/otel-lgtm has simplified observability setups, helping developers get a complete OpenTelemetry stack running in under five seconds. With integrations for metrics, logs, traces, and now profiles via Grafana Pyroscope, it has become a go-to solution for demos, development, and testing, as evidenced by its growing community (1k stars on GitHub and growing!) and notable adopters.

Read Post

Grafana

Read more about Observability in under 5 seconds: Reflecting on a year of grafana/otel-lgtm

How to Simplify AI Observability Across Hybrid and Cloud Environments

Jul 7, 2025 By LogicMonitor In LogicMonitor

As companies adopt more artificial intelligence (AI) to stay competitive and simplify operations, they’re hitting a snag they’ve seen plenty of times before: complexity. Those user-friendly chatbots and impressive predictive models aren’t magic—they run on powerful GPUs like NVIDIA’s and rely on cloud services such as Azure OpenAI or Amazon SageMaker.

Read Post

LogicMonitor

Read more about How to Simplify AI Observability Across Hybrid and Cloud Environments

Why is Open Source Important?

Jul 4, 2025 By Coroot In Coroot

🐧🐝 Try Coroot fully #FOSS and check out the latest open source observability tips on our blog: https://t.ly/qBH9f

#opensource #linux #eBPF #observability #DevOps #Coroot #SREs #kubernetes #softwarelibre #freesoftware

View Video

Coroot

Read more about Why is Open Source Important?

Observability isn't about the tool. It's about the truth

Jul 3, 2025 By Wasil Banday In Catchpoint

An enterprise client reports latency. Your dashboards say everything is fine. They blame you. You blame them. Nobody can prove it either way. This is where most monitoring efforts hit a wall. Too often, the conversation gets stuck on dashboards and tools instead of the one thing that really matters: truth. Observability isn’t about collecting metrics or building pretty dashboards.

Read Post

Catchpoint

Read more about Observability isn't about the tool. It's about the truth

LangChain Observability: From Zero to Production in 10 Minutes

Jul 3, 2025 By Anjali Udasi In Last9

LangChain apps are powerful, but they’re not easy to monitor. A single request might pass through an LLM, a vector store, external APIs, and a custom chain of tools. And when something slows down or silently fails, debugging is often guesswork. In one instance, a developer ended up with an unexpected $30,000 OpenAI bill, with no visibility into what triggered it. This blog shows how to avoid that using OpenTelemetry and LangSmith. With this setup, you’ll be able to.

Read Post

Last9

Read more about LangChain Observability: From Zero to Production in 10 Minutes

Honeycomb Telemetry Pipeline Demo

Jul 3, 2025 By Honeycomb In Honeycomb

Jessitron takes you through a 3-minute demo of the Honeycomb Telemetry Pipeline. Our makes it easier to manage telemetry so you always have the data you need, when you need it, without trading off cost, control, or visibility.

View Video

Honeycomb

Read more about Honeycomb Telemetry Pipeline Demo

Detecting & Diagnosing Problems, Across Logs, Metrics, & Traces

Jul 2, 2025 By Honeycomb In Honeycomb

What does it look like to notice & debug a problem in Honeycomb? Start with a Service Level Objective (SLO), and Honeycomb can tell you what's unusual about the events that are failing. Continue to dig into all your telemetry.

View Video

Honeycomb

Read more about Detecting & Diagnosing Problems, Across Logs, Metrics, & Traces

Netdata: The Fastest Path to Full Stack Observability. AI Powered.

Jul 2, 2025 By Netdata In netdata

Netdata is a real-time, high-performance and on-premises observability platform designed to monitor metrics and logs with unparalleled efficiency. Netdata requires zero-configuration to get started, and provides alerts, anomaly detection and AI assisted troubleshooting out of the box, providing a powerful and comprehensive infrastructure monitoring experience. Netdata is known for its distributed design. Instead of funneling all data into a few central databases like most traditional monitoring solutions, Netdata processes data at the edge, keeping it close to the source.

View Video

netdata

Read more about Netdata: The Fastest Path to Full Stack Observability. AI Powered.

What is eBPF and how can it improve observability? (in 45 seconds)

Jul 2, 2025 By Coroot In Coroot

🐧🐝 Use open source, automatic eBPF observability to gain instant system insights: https://t.ly/qBH9f

#eBPF #Linux #Kubernetes #OTEL #observability #DevOps #SRE

View Video

Coroot

Read more about What is eBPF and how can it improve observability? (in 45 seconds)

Is Your Observability Strategy Boardroom-Ready?

Jul 2, 2025 By Colin Burke In Honeycomb

At LDX3 in London last week, two roundtables I hosted with engineering leaders confirmed what many of us are starting to feel: observability isn’t just important—it’s becoming essential to how modern teams navigate the pressure to move fast and stay resilient.

Read Post

Honeycomb

Read more about Is Your Observability Strategy Boardroom-Ready?

MCP Observability with OpenTelemetry

Jul 2, 2025 By Elizabeth Mathew In SigNoz

2025 has truly been the year of Agentic AI, with MCP (Model Context Protocol) emerging as one of its flashy and most talked-about innovations. While many products have seamlessly integrated MCP servers into their systems, these servers are increasingly being labelled as black boxes, opaque components that handle critical tasks but offer little visibility into what's happening under the hood. We prompt an agent, a tool gets invoked, and a response is generated. But what really happens in between?

Read Post

SigNoz

Read more about MCP Observability with OpenTelemetry

Can Claude Code Observe Its Own Code?

Jul 1, 2025 By Austin Parker In Honeycomb

One of the great things about OpenTelemetry is that it’s a standard, and standards tend to proliferate. I was excited to see Claude Code add OpenTelemetry metric and log support in a recent release. What was really interesting—beyond the ability to capture usage data from Claude Code—is that you can also get pretty detailed logs about what you’re doing with Claude Code.

Read Post

Honeycomb

Read more about Can Claude Code Observe Its Own Code?

Why GovRAMP-authorized observability matters for state, local, and education IT teams

Jul 1, 2025 By Greg Reeder In Datadog

Building on our FedRAMP Moderate authorization and our “In Process” status for FedRAMP High, Datadog for Government is now "In Process" for GovRAMP High Authorization, giving agencies a unified observability platform that meets the toughest public-sector security bars.

Read Post

Datadog

Read more about Why GovRAMP-authorized observability matters for state, local, and education IT teams

Operations | Monitoring | ITSM | DevOps | Cloud