Operations | Monitoring | ITSM | DevOps | Cloud

Observability trends in Brazil: insights from our localized survey

Organizations in Brazil are eager to adopt some of the latest observability trends and technologies as they look to keep their software running as smoothly as possible, according to analysis of a micro survey recently conducted by Grafana Labs. Observability is an evolving space, and this is the first time Grafana Labs has run a Brazilian version of our annual Observability Survey.

Introducing Logz.io Open 360 AI: The Next Generation of Observability Is Here

Traditional observability tools can’t keep up with modern complexity. Dashboard and alert-based approaches still rely heavily on manual processes, resulting in longer troubleshooting cycles, slower decisions, and higher MTTR. Engineering teams need something better. Today we’re launching Open 360 AI, the first observability platform designed for both humans and AI agents working together.

How To Use Alloy and Hosted Graphite's Loki to Store and Visualize Logs

In a modern DevOps environment, having just metrics or just logs is like trying to navigate with half a map because you’re missing important context that makes decisions faster and smarter. Metrics tell you what is happening (CPU spikes, request rates, failed logins) but logs tell you why it’s happening, with the timestamps to prove it.

Your APIs are up, but did the payment go through?

If your challenger bank is built on composable core platforms like Mambu or Temenos, this one’s for you. Composable platforms enable API-first integration with modular services, letting you launch, adapt, and grow products quickly. That makes API health a top priority — and it shows in our State of API Reliability Report 2025 (we’ve pulled out the key fintech findings for APAC below).

Learn OpenTelemetry tracing through a grand strategy game: introducing Game of Traces

A trace always remembers! Okay, okay. I will try to keep my Game of Thrones references to a minimum throughout this post, but there is a lot of truth to that statement. In observability, a trace is the “when” and “where” of telemetry signals, allowing us to track the state of interactions between services within a microservice architecture. This makes traces the ideal observability signal for discovering bottlenecks and interconnection issues.

Top 7 Application Performance Monitoring Tools

Your application is under constant pressure to deliver low latency, high reliability, and a smooth user experience isn’t optional. When performance drops, every second matters. Application Performance Monitoring (APM) gives you the visibility to spot issues before your users feel the impact. It also helps you understand what’s happening inside your stack, so you can track resource usage, pinpoint bottlenecks, and keep things running at peak performance.

HTTP status codes? Here's a cheat sheet

Whenever you visit a website or click on a link, there’s a whole conversation happening behind the scenes between your browser and the web server. That conversation includes something called HTTP status codes and knowing what they mean can help you make a diagnosis, so to speak. Usually, everything goes smoothly (like a 200 OK), but sometimes things break (looking at you, 404 and 500).

Best Practices for Managing Multiple Vendor Dependencies

Modern businesses rely on dozens of third-party services to operate efficiently. From payment processors and cloud providers to analytics tools and communication platforms, these vendor dependencies form the backbone of your technology stack. When one fails, it can trigger a cascade of issues across your entire operation. Managing multiple vendor dependencies requires a strategic approach that combines proactive monitoring, clear documentation, and well-defined response procedures.

What Is a Telemetry Pipeline and Why It Matters in Modern IT

A practical guide for IT professionals, DevOps, security teams, platform engineers, and anyone who’s dealing with logs. In contemporary distributed systems, telemetry data—logs, metrics, traces, and events—serves as the primary mechanism for understanding internal system behavior. However, as system complexity increases, so does the volume and heterogeneity of telemetry.