Operations | Monitoring | ITSM | DevOps | Cloud

Simulate Real User Workflows | Introduction to Grafana Cloud Synthetic Monitoring

Just because your app is up doesn’t mean it’s working. Behind the scenes, users could be facing failed checkouts, broken workflows, or slow page loads — and you may not know until it’s too late. In this video, we’ll show you how Grafana Cloud Synthetic Monitoring helps you proactively simulate real user behavior and monitor the performance of your critical user flows, websites, and APIs from locations around the world — so you can catch issues before your users do.

Azure DevOps agent pools: diving deeper

Most of the time the build and deployment pipelines we create will run on compute provided by the Azure DevOps cloud and the only decision we need to make is whether to select a Windows or Linux Agent. Sometimes though, the specification for the VM that Azure DevOps spins up may not be right for our needs. We may need more memory or a particular OS version. This is when custom agents and Agent Pools come into play.

A simple new way to visualize Prometheus

Even if you don’t work with Prometheus day-to-day, you most likely have heard of it. After Kubernetes was donated to the Cloud Native Computing Foundation (CNCF), Prometheus became the second project to be incubated soon after. That was back in 2016 and it is still one of the most active CNCF projects. Why is it so popular? It’s the de facto monitoring tool for containerized workloads running on-prem and in the cloud – that is, it’s the monitoring tool for Kubernetes.

Getting Started With AWS Dashboards

Being the most popular cloud solution provider, AWS needs no introduction. With its powerful and numerous services and solutions, many companies of all sizes and shapes run their applications and/or infrastructure on AWS. With AWS being integrated with other internal services as well as external solutions hosting the business apps, it is crucial to be aware of what's happening across the landscape and beyond, to ensure business continuity. The AWS plugin for SquaredUp helps you achieve exactly that.

How to keep Ingress NGINX Controller metric volumes manageable and still meaningful

The Ingress NGINX Controller is a widely used Kubernetes component for managing HTTP and HTTPS traffic routing. While it provides powerful observability through Prometheus metrics, it’s also notorious for generating an excessively high number of time series. The root cause lies in how the controller labels its metrics—tracking requests across multiple dimensions such as ingress name, host, path, status code, and upstream response times.

Why Observability is Getting Expensive and OpenTelemetry is Becoming More Popular | Grafana Labs

Grafana Labs' Jen Villa shares the latest insights into how organizations are rethinking their observability strategies — with cost now taking center stage. This video covers: Chapters: Grafana Cloud is the easiest way to get started with Grafana dashboards, metrics, logs, and traces. Our forever-free tier includes access to 10k metrics, 50GB logs, 50GB traces and more. We also have plans for every use case.

Grafana Campfire - Data Visualization Tips and Best Practices (Grafana Community Call- April 2025)

Creating dashboards in Grafana gives you some very good built-in features to manipulate your data by using transformations, variables, filtering, overriding,, annotations for your data and with addition of Community plugins (data sources, panels and apps) increases the user experience to a whole next level. Still, many users do not either know about these features or do not use it correctly and why is the reason?

Everything You Need to Know About OpenTelemetry Histograms

Modern systems throw off a lot of data—metrics, traces, logs—sometimes more than we know what to do with. When you're trying to understand how values spread out over time (like response times, memory usage, or queue lengths), averages alone don’t tell the full story. OpenTelemetry histograms help fill in those gaps. This guide walks through what they are, why they matter, and how DevOps engineers can use them to improve observability in real systems.

All about OTel and Logging on Kubernetes with Loki (Loki Community Call April 2025)

In this pre-recorded Loki Community Call, we talk all about OTel and logging on Kubernetes with Cyril Tovena, Ward Bekker, Jay Clifford, and Nicole van der Hoeven at KubeCon EU 2025 in London. We discuss when why you should switch to OTel and why you shouldn't, what OTLP is exactly, and best practices for ingesting data through an OTLP endpoint.