Operations | Monitoring | ITSM | DevOps | Cloud

A smarter filter for Grafana Alerting: Introducing a new way to find your alerts

At Grafana Labs, we believe that effective alerting is the cornerstone of any robust observability strategy. That’s why we’re constantly listening to your feedback and working to improve the Grafana user experience so it’s easier for you to manage and interact with your alert rules. Today, we’ve excited to tell you about an update in Grafana Alerting that’s built to address some of your biggest pain points.

Logs vs. Metrics: Why You Need Both for Observability

Picture this: Your dashboards are calm. CPU load is steady. Error rates are low. Everything looks fine. That is, until the alarms go off. Now what? Metrics tell you something’s wrong, but not what, where, or why. They reveal symptoms, not root causes, and in high-stakes environments, that’s only half the story. Say your API response times spike. Metrics raise the flag, but they don’t tell you if it’s a code deployment, a database hang, or a traffic surge.

Visually identify observability gaps with Cloudcraft in Datadog

Modern cloud environments are highly complex and dynamic, with critical services relying on large numbers of ephemeral resources. Ensuring observability coverage across this landscape is essential for troubleshooting, maintaining reliability, optimizing performance, and enforcing security standards. But as environments grow more elaborate and their ownership more dispersed, tracking observability coverage becomes increasingly challenging.

Monitor Windows Certificate Store with Datadog

The Windows Certificate Store is a critical component of any modern Windows environment. Certificates enable TLS encryption for Internet Information Services (IIS)-hosted applications, support certificate-based authentication in Active Directory, and help validate the identity of trusted Windows services. But if a certificate in your store expires, is revoked, or is part of a broken certificate chain, you risk instability and security gaps in your Windows environment.

Unlocking insights: Introducing Step Metrics for Bitbucket Pipelines

We’re excited to announce step metrics – a new capability coming to Bitbucket Pipelines to help you better manage and optimise your CI/CD workflows. Ever wondered what’s happening under the hood during your pipeline runs? Step metrics provide a window into the resource usage of your build and service containers. More specifically, step metrics let you monitor CPU and memory usage for each build and service container in your pipeline steps.

What is the Internet Stack... and why should you care?

We talk a lot about the application stack, the code and services you build. However, just as critical is the infrastructure that delivers that code to your users. That’s the Internet Stack: a complex chain of technologies and services, from DNS and BGP to CDNs and ISPs, that every digital experience depends on. It’s separate from your application stack. It’s different for every user, in every geography. And most importantly, it still impacts your users—even if you don’t directly own it.

Query Builder v5 - Two Years of Technical Debt, 80 Closed Issues, and a Fundamental Rethinking

In 2022, we had three different query interfaces. Logs had a custom search syntax with no autocomplete. Traces only had predefined filters - no query builder at all. Metrics had a raw PromQL input box where you'd paste queries from somewhere else and hope they worked. Each system spoke a different language. An engineer debugging a production issue had to context-switch not just between data types, but between entirely different mental models of how to query data.

If AI isn't Driving Growth, CX, and Innovation, You're Doing It Wrong

While headlines celebrate each new breakthrough in AI capabilities, businesses are quietly mastering a different art: deploying focused AI solutions that target specific operational challenges. This shift changes everything. We're moving from generic AI that tries to do everything to, as Gartner says, an ROI-driven implementation that does exactly what your business needs. The future of AI's successful adoption lies in smarter applications that solve real enterprise problems.

Debugging and logging in Laravel applications

Logic errors, failed HTTP requests, background jobs that ghost silently—software breaks in all kinds of fun ways. The difference between resilient systems and fragile ones isn’t about avoiding errors altogether. It’s about how fast and clearly you can see what went wrong, and fix it. Laravel gives you a solid foundation: structured logging, real-time introspection, and built-in performance monitoring.