Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Monitor groups are now supported in the API

We recently launched monitor groups, making it easier to organize monitors on your boards and status pages. Now that same functionality is available in the StatusGator API, so you can manage monitor groups programmatically. The API now supports listing, creating, updating, and deleting monitor groups on a board. You can also assign or remove monitors from groups when creating or updating a monitor.

Best DNS Monitoring Tools in 2026

DNS monitoring is the practice of continuously checking that your domain names resolve correctly (right records, right answers) and that DNS lookups are fast and reliable from multiple locations. Depending on the tool, it can also watch for unexpected DNS record changes (A/AAAA/CNAME/MX/NS/TXT, etc.), validate DNSSEC, and pinpoint where resolution breaks in the chain.

API Uptime Monitoring Explained: How to Measure True API Availability in Production

For many teams, API uptime monitoring still means one simple thing: checking whether an endpoint responds with a 200 OK. If the check passes, the API is marked as “up.” If it fails, an alert is triggered. On paper, that sounds reasonable. In practice, it’s one of the most common reasons API outages go unnoticed until users complain. The problem is that modern APIs are no longer simple, stateless endpoints.

Uptime.com Real User Monitoring Report

Take an in-depth tour of the Uptime.com RUM report. Comprehensively understand your users – and your baselines. Organize RUM data by URL(s) or group URL(s) to track subdomains; segment data by devices, operating systems, browsers, countries, other geographies – to compare metrics within specific time windows to your website or application’s performance monitoring baselines.

Building a synthetic monitoring solution for Jaeger with Grafana k6

Wilfried Roset is an engineering manager who leads an SRE team and he is a Grafana Champion. Wilfried currently works at OVHcloud where he focuses on prioritizing sustainability, resilience, and industrialization to guarantee customer satisfaction. As an SRE Engineering Manager and a Grafana Champion, I believe a resilient and sustainable cloud experience begins with strong observability.

AI Is Bigger Than LLMs: Why Network Teams Need to Think Beyond Chatbots and Agents

AI in network operations is more than chatbots and agents. LLMs make AI easier to use, but the real value comes from the underlying system of telemetry, data pipelines, analytics, ML models, domain knowledge, and workflows that help engineers reason, predict, and act. When designed thoughtfully, AI doesn’t replace engineers. Instead, it augments their expertise and reduces cognitive load across complex network operations.

Heartbeat behind the metrics | Jasper on why availability will never stop mattering

What does it take to build a monitoring platform that teams rely on every single day? In this episode of Heartbeat Behind the Metrics, Jamesraj Paul Jasper, Principal Product Manager of Site24x7, talks about his 15-year journey with the product and the moments that still stand out. He dives into why APM Insights is closest to his heart, and also shares a proud team moment where a complex enterprise feature was designed, built, and shipped in just two weeks through tight coordination.