Operations | Monitoring | ITSM | DevOps | Cloud

Observability for containerized workloads: How to run Grafana Beyla as a sidecar in Amazon ECS

Note: Grafana Beyla has been donated to OpenTelemetry under the new project name OpenTelemetry eBPF Instrumentation. Beyla will continue to exist as Grafana Labs’ distribution of the upstream project. Grafana Beyla is an open source eBPF-based auto-instrumentation tool that helps you easily get started with application observability, allowing you to monitor and visualize traces without modifying the application code.

Getting started with VMware dashboards

VMware is a leading platform for virtualization and cloud infrastructure, widely used to manage compute, storage, and networking resources across on-premises and hybrid environments. While it offers powerful capabilities and extensive telemetry through tools like vCenter, navigating this data can be overwhelming – especially when trying to spot performance issues, capacity trends, or VM sprawl in real time. That’s where a solution like SquaredUp can make a significant difference.

Customizing your Azure DevOps DORA metrics dashboard

Looking to configure and customize a DORA metrics dashboard? Our Director of Engineering Services, Tim Wheeler, demonstrates how to customize the DORA Metrics dashboard in Azure DevOps for SquaredUp. He shows how to populate key metrics like deployment frequency and change failure rate by selecting a pipeline, specifically the Squared Up multi-stage pipeline.

Observability in under 5 seconds: Reflecting on a year of grafana/otel-lgtm

With grafana/otel-lgtm, observability is just one Docker command away. Over the past year, grafana/otel-lgtm has simplified observability setups, helping developers get a complete OpenTelemetry stack running in under five seconds. With integrations for metrics, logs, traces, and now profiles via Grafana Pyroscope, it has become a go-to solution for demos, development, and testing, as evidenced by its growing community (1k stars on GitHub and growing!) and notable adopters.

From chaos to clarity with Grafana dashboards: How video game company EA monitors 200+ metrics

To be a successful gamer, you have to think strategically and creatively. Working as a software engineer at Electronic Arts (EA), a top video game company, requires the same skills. That’s especially true when it comes to monitoring the EA app, which is the launcher for EA games and used by hundreds of millions of people around the world.

InfluxDB 3 Core: a complete rewrite designed for speed and simplicity

InfluxDB has been a popular time series database for the better part of a decade, and the latest release represents years of work behind the scenes to address several major feature requests users have been asking for since the earliest days of the time series database.

Faster incident response through distributed tracing: Inside Glovo's use of Traces Drilldown

It’s almost 1 p.m. on a Monday afternoon and you’re hungry. You pull up your meal delivery app and select your favorite restaurant and dish. Then you go to check out and nothing happens. Your frustration mounts as you get hungrier by the minute. But there’s frustration on the other side of that transaction as well—engineers are scrambling to figure out what’s wrong as orders drop and revenue losses rise.

Prometheus Gauges vs Counters: What to Use and When

Choosing the wrong metric type in Prometheus can lead to inaccurate dashboards, false positives in alerting, and missed indicators of system failure. Gauge metrics are intended for tracking values that can go up and down, such as memory usage, queue depth, or the number of active connections. Unlike counters, which only increment (or reset on restart), gauges reflect the current state of a resource at scrape time.