%term

The latest News and Information on Observabilty for complex systems and related technologies.

Observability and AI explained - Grafana Labs

Jul 31, 2026 By Grafana In Grafana

Senior Developer Advocate Nicole van der Hoeven explains how we're thinking about observability and AI at Grafana Labs. She talks about how you can use AI with observability across the entire SDLC and the different tools you can use for both AI for observability and observability for AI.

View Video

Grafana

Read more about Observability and AI explained - Grafana Labs

Agent Observability Deep Dive Demo | Grafana Cloud

Jul 31, 2026 By Grafana In Grafana

Grafana AI Observability is our new database and platform for observing AI Agents. Over the past year at Grafana Labs, we built Agents and we needed a way to understand how they are performing, what are the costs associated with them, what's the error rate or time to the first token as well as how they are behaving. Grafana Staff Engineer, Ivana Hučková provides a deep dive demo on how Grafana AI Observability connects our experience building Agents with our experience building observability systems.

View Video

Grafana

Read more about Agent Observability Deep Dive Demo | Grafana Cloud

SaaS vs Self-Hosted Observability (Including BYOC): Cost, Latency, Control and Governance

Jul 31, 2026 By Mohana Ayeswariya J In Atatus

Most teams don't choose an observability deployment model, they inherit one. Someone signed up for a SaaS trial three years ago, telemetry volume grew 40x, and now Finance is asking why the observability line item is the fourth-largest infrastructure cost on the P&L. Or the opposite: a self-hosted Prometheus and Elasticsearch stack was stood up when the company had twelve services, and now it needs a team of two just to keep it upright while everyone else waits on slow dashboards during incidents.

Read Post

Atatus

Read more about SaaS vs Self-Hosted Observability (Including BYOC): Cost, Latency, Control and Governance

Comprehensive Postgres observability: is your database healthy, and will it stay that way?

Jul 31, 2026 By Nikolay Sivko In Coroot

A Postgres database is one of the most complex pieces of software you run. It stores your data durably so a power loss doesn't lose a committed transaction. It hands out real transactions while thousands of queries fight over the same rows. It stays available when a node dies, lets you rewind to a point in time after someone drops the wrong table, and constantly tunes itself with background jobs. Two questions matter about all of that machinery: is it healthy right now, and will it stay healthy?

Read Post

Coroot

Read more about Comprehensive Postgres observability: is your database healthy, and will it stay that way?

How to build a trust platform for your agent with Grafana Agent Observability

Jul 30, 2026 By Jack Gordley In Grafana

Observing fast-growing agentic workloads is no small feat, especially if you try to build your own monitoring stack or rely solely on tools built for a time before LLMs. At Grafana Labs, we know this all too well.

Read Post

Grafana

Read more about How to build a trust platform for your agent with Grafana Agent Observability

Reflections on AI Week, and the future of solving problems with observability and AI

Jul 30, 2026 By Mat Ryer In Grafana

Thank you for spending AI Week with us. We’re thrilled by the reaction and we all enjoyed replying to your questions. Thanks for engaging.

Read Post

Grafana

Read more about Reflections on AI Week, and the future of solving problems with observability and AI

Honeycomb Named a Visionary in the 2026 Gartner Magic Quadrant for Observability Platforms

Jul 29, 2026 By Shabih Syed In Honeycomb

For the third consecutive year, Honeycomb has been recognized for its Ability to Execute and Completeness of Vision, and we believe for its strong vision around fast, flexible, high-cardinality querying that helps engineers understand not just that something broke, but why. The software development lifecycle has collapsed. The neat sequence of plan, build, test, and ship that teams have relied on for 20 years is now happening in a single afternoon. AI writes a large share of the code.

Read Post

Honeycomb

Read more about Honeycomb Named a Visionary in the 2026 Gartner Magic Quadrant for Observability Platforms

What Comes Next: Honeycomb Demos at O11yCon 2026

Jul 29, 2026 By Honeycomb In Honeycomb

In this session at O11yCon, Purvi Kanal, Jamie Danielson, and Martin Holman demoed the new Canvas. Canvas now understands OpenTelemetry GenAI semantic conventions, and can show agent invocations, LLM calls, and tool calls all in one trace view. Humans and agents can work in the same place, with skills that let each team encode their own expertise so both the agent and their colleagues can use it. Multiplayer support means you can see your teammates' cursors and share charts.

View Video

Honeycomb

Read more about What Comes Next: Honeycomb Demos at O11yCon 2026

Reproducing split brain on CloudNativePG

Jul 29, 2026 By Nikolay Sivko In Coroot

We run Postgres under an operator for automatic failover. That is a promise about what happens during a failure, so the only way to know you have it is to cause the failure and watch. The docs tell you what should happen. A config review tells you which knobs are set. Neither tells you how long an isolated primary keeps accepting writes after its replacement has been promoted, and that number decides whether a failover is clean or leaves you with two versions of your data.

Read Post

Coroot

Read more about Reproducing split brain on CloudNativePG

The Gremlin app for Dynatrace: resilience testing and reliability scoring, built on the observability you already trust

Jul 28, 2026 By Ryan Detwiller In Gremlin

Dynatrace gives engineering teams deep, real-time visibility into every service they run. That visibility is the foundation of every effective reliability practice, and it's exactly the foundation Gremlin is built to extend. Once you can see how your distributed systems behave today, the next step is knowing how they'll behave under failure tomorrow—and to do it before those failures happen.

Read Post