Operations | Monitoring | ITSM | DevOps | Cloud

January 2023

The Inevitable - Failures in Distributed Systems

Experiencing failure at scale is as the popular Marvel character Thanos would say “Inevitable”. Memory leaks, software or hardware or network I/O failures are just a few. It’s a problem of simple mathematics, the probability of failing rises as the total number of operations performed increases. With each component used to scale the application, the failure quotient increases. So how do you tackle this so-called “Inevitable” problem that comes with scaling?