Operations | Monitoring | ITSM | DevOps | Cloud

Alerting

Then and Now: Distributed Systems Alerting and Monitoring

Distributed systems are everywhere. Although many teams don’t think of their applications as distributed systems, if they’re developing using container-based microservices and serverless functions instead of a monolith, they’re creating a distributed system. This change also means that monitoring needs are becoming more complex.

From Metrics to Valuable Insights: Incident Post-Mortem Reports

IT organizations, such as managed service providers (MSPs), deploy incident alerting and on-call management solutions to accelerate software delivery and ensure seamless customer experiences. Incident alert management platforms orchestrate the distribution of alerts to ensure that technicians continue to maintain system uptime and minimize service disruptions.

Troubleshooting Outages at 3 AM with Alert Response

Imagine you are an on-call engineer, who receives an alert at 3 AM in the morning informing you that customers are experiencing high latency on your website, and are unable to shop. Being an Incident response coordinator myself at Sumo Logic, I can tell you, I don’t envy being that engineer. If this alert fired, this is what would likely follow: The biggest challenge is how to gather this information quickly, so you can decide whether to jump out of the bed or go back to sleep.

Automate, Group, and Get Alerted: A Best Practices Guide to Monitoring your Code - Part 1

As companies grow, so do their products, teams, and the number of external tools. For engineers, that can mean code sprawl, data silos, notification fatigue, and some “what the…?” moments along the way as they try to make sense of it all.

What Exactly is a Website Monitoring "False Alarm" and Why You Should Care About It

What Exactly is a Website Monitoring “False Alarm” and Why You Should Care About It You know what falsehoods are. You know what false teeth are. You may even know some falsehoods about false teeth. But do you know what a website monitoring false alarm (also known as a “false positive”) is? If not, then please keep reading to find out — because it’s a very big deal.

Android App Update: Mute and enhanced 'Do not disturb' override

With our latest Android app update (3.1., build 242) you will never miss a critical SIGNL4 alert again. Even if your phone is muted or in do-not-disturb mode, SIGNL4 can now make a lot of ‘noise’ and wake you up reliably when a major or critical incident occurs. Here is how it works….