Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Icinga Experience: Insights from Real-World Icinga Deployments Across Industries

Modern IT environments are hybrid, distributed, and constantly growing. To keep them reliable, organizations rely on monitoring that scales, automates, and integrates seamlessly into existing workflows. We collected 24 Icinga customer stories from industries including finance, telecom, manufacturing, and public services. What unites them is the choice of Icinga as a flexible and cost-efficient alternative to proprietary monitoring tools.

Faster, more memory-efficient performance in Grafana Mimir: a closer look at Mimir Query Engine

Until recently, Grafana Mimir — our open source, horizontally scalable, multi-tenant time series database (TSDB) — has exclusively used Prometheus’ PromQL engine to evaluate queries. While the PromQL engine works great, it sometimes needs a lot of memory to run, specifically in the Mimir querier component. To address this memory consumption issue, we recently introduced Mimir Query Engine (MQE).

What is Asynchronous Job Monitoring?

Modern applications don’t process everything inside the request/response path. To keep APIs responsive, time-consuming work like image resizing, payment processing, or data syncs is moved into background queues. Workers then pick up these asynchronous jobs and run them outside the main thread. Asynchronous job monitoring is the practice of tracking these background tasks: Without this visibility, background workers become a blind spot.

SQL performance improvements: finding the right queries to fix (part 1)

A few weeks ago, we massively improved the performance of the dashboard & website by optimizing some of our SQL queries. In this post, we'll share how we identified the queries that needed work. In the next post, we'll explore how we fixed each of them. We'll cover the basics and gradually work our way up to the more advanced/complex ways of identifying slow queries. In this post, you'll see: Let's go!

Making the invisible visible: Are your cloud firewalls and DDoS protection really working?

Every business builds strong defences to keep attackers out. Firewalls and DDoS protection serve that purpose, standing guard over company apps and websites, like knights at the castle gate keeping out trolls (not just the ones on X). But here’s the problem: those defences only work if users actually walk through the front gate. Sometimes, people find hidden paths or side doors around your walls, so the guards never see them enter.

CloudSpend for iOS 26 for sharper, smarter, and simpler cloud cost management

Experience seamless control, clarity, and cost optimization with the CloudSpend app on iOS 26. This update integrates Apple’s new Liquid Glass design and secure, on-device AI summaries to deliver instant insights into your cloud spending, empowering you to act decisively from anywhere.

Logs & Lattes: Episode 1 - Smart Logging Without the Price Trap

How much value are you really getting from your logs, and what are you giving up to stay on budget? In this episode of Logs and Lattes, host Palmer Wallace sits down with Seth Goldhammer, VP of Product Management at Graylog, for a candid conversation about the hidden cost of traditional SIEM pricing. Seth explains how ingest-based and resource-heavy licensing models pressure security teams into tough tradeoffs, such as dropping logs, tuning down detections, or limiting retention just to avoid budget overages.