Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

New Google Cloud Run Visualization in Grafana Cloud | Demo | How to Monitor Google Cloud Run

Perfect for troubleshooting, performance tuning, and cost optimization, this new feature helps you stay in control of your Cloud Run workloads. With this sophisticated dashboard, you can: Monitor CPU, memory, network traffic, and active requests at a glance Drill down into individual services and containers with a single click Identify resource usage spikes and optimize performance Use the Right-Sizing View to find the top resource-heavy services & containers.

How we got abused via OTP

Going through my emails, I saw several about Twilio's auto-recharge, and then something about a suspension. We were using Twilio to send SMS messages and phone call alerts. "That's odd, let me check!". I logged into Twilio from my phone and checked. Horror. Instant horror. The balance was insane. But negative. I told my friend I need to sit down and check something. Pulled out my laptop and logged in. Same information. Same insane balance. Right there and then I knew it... we've been abused.

Essential Steps for Troubleshooting Network Problems

Everyone has a story about that one road trip where traffic got backed up, making people late to the event. When you have network connectivity problems, your information highway gets clogged up, making it difficult for users to access resources efficiently. While network troubleshooting strategies may seem simple, a lot of nuance and complexity lies in the activities when you dig into your data.

Simplifying Multi-Node Setups with InfluxDB 3 Enterprise Modes

As your time series data grows, managing increasing workloads can quickly become a headache. High data ingestion rates, numerous (and complex) queries, intensive processing tasks, and routine maintenance like data compaction often compete for limited resources. This leads to unpredictable performance and slower response times, and common solutions often introduce operational complexity.

Email Marketing and Website Downtime: How to Ensure Landing Pages Are Always Accessible

You know how important ensuring your business's round-the-clock availability is, especially if you operate across different time zones. With online businesses, marketing and sales never stop, catering to consumers 24/7 through chatbots, AI assistants, and server redundancy.

Investigating an '[Object] not found' error in Next.js with Tracing in Sentry

Breakpoints and console.log statements might save your sanity during local dev, but production issues are another story. In prod, your errors might be distributed across different microservices, or hidden in minified code. Good luck hunting those down. That’s where Sentry’s traces and spans come in, offering you easy visibility into every network request, API call, DB fetch and more in a full-stack, distributed environment.

Why Do You Need a Redis Monitor in Place?

Redis Monitor is a simple yet powerful command-line tool that displays every command processed by a Redis server in real-time. It provides visibility into exactly what’s happening inside a Redis instance as it happens. Running a single command can uncover hidden performance issues: The output reveals thousands of unexpected HGETALL operations on a key that should be accessed infrequently. This exposes a Redis call inside a loop, causing unnecessary database strain.

When Should You Enable Trace-Level Logging?

There’s nothing like debugging a broken system at 2 AM, running on caffeine and frustration. When everything’s on fire, logs are your lifeline. That’s where trace-level logging comes in. Unlike standard logs, it captures the step-by-step execution of your code—think of it as the difference between a crime report and full CCTV footage. But more logs don’t always mean better debugging. Too much detail, and you’re drowning; too little, and you’re guessing.