How to run checks on internal services with Grafana Cloud Synthetic Monitoring

Many critical services run inside private networks, where traditional monitoring tools and practices can’t offer full visibility. This makes it difficult to validate service availability and performance before problems impact your users. Synthetic Monitoring — a Grafana Cloud solution that helps you proactively monitor the performance of your applications and services — addresses this gap with a feature known as private probes.

The KPI Trap in Supply Chain Analytics: How Teams Escape It

KPI dashboards can look like progress, especially when charts move and targets turn green. In many organizations, the dashboard becomes the meeting, the meeting becomes the process, and the process quietly replaces real problem solving. The result is a lot of motion, not much traction, and a team that feels busy without feeling effective.

Build, buy, or open source? Understanding your options with Grafana's AI-powered observability

Some questions in engineering never go away. Here’s one that every team eventually confronts: Do we roll up our sleeves and build the tooling ourselves, or do we buy something built for us? It’s a choice that has the power to speed teams up or hold them back. With the rise of AI-powered observability, this familiar software dilemma has re-emerged with higher stakes and faster-moving technology.

Dashboard organization isn't about folders - it's about visibility

Having well-organized dashboards is just as important as having good dashboards. But dashboard organization shouldn’t just make things easy to find. It should provide structure that supports collaboration and efficient troubleshooting. It has to be more than a basic folder system. This post looks at how classic dashboarding tools handle organization today, where they fall short, and how SquaredUp Workspaces organize for visibility and shared context.

Continuous profiling in production: A real-world example to measure benefits and costs

Continuous profiling offers deep visibility into production environments, revealing exactly how applications consume CPU and memory. It’s the go-to observability practice for directly connecting system behavior and performance to specific lines of code. But when teams consider deploying continuous profiling more broadly, a common question comes up: what’s the overhead? Is it safe to run continuous profiling on my production services 24/7, or does the cost outweigh the benefits?

How we built Grafana Assistant - a conversation about AI development for observability

This conversation with Grafana Labs engineers, Mat Ryer, Cyril Tovena and Sven Großmann, dives deep into the engineering behind Grafana Assistant, exploring how agentic AI is transforming the observability landscape. From hackathon origins to sophisticated backend agents, the team shares candid lessons on building, scaling, and refining AI tools for engineers.

Grafana dashboards as code: How to manage your dashboards with Git

Note: This blog post originally published in May 2025 and was updated in February 2026 to reflect that Git Sync is now available in public preview in Grafana Cloud. As your Grafana instance scales, so does the challenge of maintaining dashboards. Managing dozens—or hundreds—of dashboards through the UI alone can quickly become overwhelming. Tracking changes gets murky, dashboards multiply, and consistency suffers.

Add skills to agents: Use Assistant playbooks for faster answers, investigations

Grafana Assistant is the most general-purpose tool we’ve delivered since dashboards. People use our Grafana Cloud LLM to understand unfamiliar areas of their stacks, generate dashboards and beautiful visualizations out of thin air, build queries, and support investigations.

Custom Dashboard Creation: Step-by-Step Tutorial

Creating a custom dashboard is the best way to monitor metrics that matter most to your systems. Tools like MetricFire make this process straightforward by combining hosted Grafana and Graphite, eliminating the need for self-hosted solutions. Here's how you can build dashboards tailored to your needs.

Observing agentic AI workflows with Grafana Cloud, OpenTelemetry, and the OpenAI Agents SDK

As agentic AI applications are used more broadly in production, they introduce new operational models, combining multi-step reasoning, tool execution, and autonomous decision-making into a single workflow. SRE teams need visibility into how these agents behave, where they fail, and how they perform over time.