Operations | Monitoring | ITSM | DevOps | Cloud

OpenTelemetry UI: The Ultimate Guide for Developers

If you’ve ever struggled with understanding distributed traces, managing metrics, or debugging complex applications, OpenTelemetry is your best friend. But what about the OpenTelemetry UI? How do you visualize and interact with all that telemetry data? In this guide, we’ll explore the best ways to use OpenTelemetry’s UI options, from setting up a proper observability stack to choosing the right front-end visualization tools.

Integrating OpenTelemetry with Grafana for Better Observability

Modern application observability is essential for ensuring system performance, diagnosing issues, and optimizing user experiences. OpenTelemetry (Otel) and Grafana serve as two key components in achieving end-to-end visibility. While OpenTelemetry focuses on instrumenting applications to collect telemetry data, Grafana specializes in visualizing this data, making it actionable and insightful.

An In-Depth Guide to Java Performance Monitoring for SREs

If you've ever had a Java application slow down in production and struggled to pinpoint the cause, you know the pain of performance issues. Java is a powerful, high-level language, but it doesn’t come without challenges—especially when it comes to resource management, garbage collection, and thread handling. This guide will take you through everything you need to know about Java performance monitoring, from key metrics to tools and best practices.

CLM Chowder: Digging Into the Cloud Latency of Azure, Google Cloud, and OCI

CLM Chowder is a new series which highlights notable observations of cloud connectivity surfaced by Kentik’s Cloud Latency Map. In this edition, we look at measurements from Alibaba (China), latency swings from South Africa, and a temporary latency jump from Marseilles to Asia.

The next generation of Grafana Mimir: Inside Mimir's redesigned architecture for increased reliability

This year Grafana Mimir — the open source, horizontally scalable, multi-tenant time series database (TSDB) — will celebrate its third anniversary. Over the years, Mimir has become the go-to, Prometheus-compatible metrics backend within the open source community, with 29 maintainers and more than 4.6k GitHub stars. Since introducing Mimir, we’ve worked hard to deliver on our promise of making it the most scalable and performant open source TSDB in the world.

Grafana Drilldown apps: the improved queryless experience formerly known as the Explore apps

When we introduced the Explore apps suite for metrics, logs, traces, and profiles last year at ObservabilityCON 2024, our goal was simple: offer a queryless, point-and-click experience so you can quickly find insights in your observability data—no queries or complicated syntax required. Our commitment to that goal remains unchanged, but we’re excited to announce that the Explore apps have a new name: Grafana Drilldown.

Intelligent Alerting with RapidSpike and ilert Integration

When it comes to website performance and uptime, every second counts. Businesses rely on tools like RapidSpike to monitor their digital presence, ensuring websites and applications run smoothly. However, effective alerting and incident management are just as critical as monitoring itself. That’s where ilert comes in.

DORA Compliance - An Opportunity for MSPs

For Managed Service Providers (MSPs) in the EU, who serve financial organizations, DORA regulatory compliance is a hot topic. The DORA (Digital Operational Resilience Act) is a new regulation that came into force on Jan 17th, 2025, aimed at ensuring the operational resilience of financial entities in the EU, focusing on technology risk management and minimizing disruptions in critical services.

New Integration: ilert + RapidSpike for Proactive Website Monitoring

We are pleased to announce a new inbound integration in the ilert catalog: RapidSpike. This integration enhances incident management by connecting ilert with RapidSpike’s website monitoring capabilities, ensuring teams receive real-time alerts on website performance, uptime, and security threats.