Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Introducing Checkly Uptime Monitoring: A Fast and Affordable Way to Detect Infrastructure Downtime

Learn more about Checkly, the application reliability platform designed for modern engineering teams! Discover how Checkly enables you to quickly detect, communicate, and resolve production issues and explore the newly added uptime monitoring features, including URL, TCP, and heartbeat monitors. Configure and manage your entire monitoring setup using monitoring as code!

Introducing Logz.io Open 360 AI: The Next Generation of Observability Is Here

Traditional observability tools can’t keep up with modern complexity. Dashboard and alert-based approaches still rely heavily on manual processes, resulting in longer troubleshooting cycles, slower decisions, and higher MTTR. Engineering teams need something better. Today we’re launching Open 360 AI, the first observability platform designed for both humans and AI agents working together.

How To Use Alloy and Hosted Graphite's Loki to Store and Visualize Logs

In a modern DevOps environment, having just metrics or just logs is like trying to navigate with half a map because you’re missing important context that makes decisions faster and smarter. Metrics tell you what is happening (CPU spikes, request rates, failed logins) but logs tell you why it’s happening, with the timestamps to prove it.

Your APIs are up, but did the payment go through?

If your challenger bank is built on composable core platforms like Mambu or Temenos, this one’s for you. Composable platforms enable API-first integration with modular services, letting you launch, adapt, and grow products quickly. That makes API health a top priority — and it shows in our State of API Reliability Report 2025 (we’ve pulled out the key fintech findings for APAC below).

Learn OpenTelemetry tracing through a grand strategy game: introducing Game of Traces

A trace always remembers! Okay, okay. I will try to keep my Game of Thrones references to a minimum throughout this post, but there is a lot of truth to that statement. In observability, a trace is the “when” and “where” of telemetry signals, allowing us to track the state of interactions between services within a microservice architecture. This makes traces the ideal observability signal for discovering bottlenecks and interconnection issues.

Top 7 Application Performance Monitoring Tools

Your application is under constant pressure to deliver low latency, high reliability, and a smooth user experience isn’t optional. When performance drops, every second matters. Application Performance Monitoring (APM) gives you the visibility to spot issues before your users feel the impact. It also helps you understand what’s happening inside your stack, so you can track resource usage, pinpoint bottlenecks, and keep things running at peak performance.

HTTP status codes? Here's a cheat sheet

Whenever you visit a website or click on a link, there’s a whole conversation happening behind the scenes between your browser and the web server. That conversation includes something called HTTP status codes and knowing what they mean can help you make a diagnosis, so to speak. Usually, everything goes smoothly (like a 200 OK), but sometimes things break (looking at you, 404 and 500).

Best Practices for Managing Multiple Vendor Dependencies

Modern businesses rely on dozens of third-party services to operate efficiently. From payment processors and cloud providers to analytics tools and communication platforms, these vendor dependencies form the backbone of your technology stack. When one fails, it can trigger a cascade of issues across your entire operation. Managing multiple vendor dependencies requires a strategic approach that combines proactive monitoring, clear documentation, and well-defined response procedures.