It seems like we can’t escape the burden of uncertainty. From the countless things we were uncertain about during the Covid-19 pandemic to the current state of the economy, financial circumstances and trends, and fear of unemployment. For many of today’s businesses of all sizes, predicting the future might appear nearly impossible. IT teams at the start of the pandemic in 2020 shifted without notice to supporting remote employees.
Picture a simple E-commerce platform with the following components, each generating logs and metrics. Imagine now the on-call Engineer responsible for this platform, feet up on a Sunday morning watching The Lord of The Rings with a coffee, when suddenly the on-call phone starts to ring! Oh no! It’s a customer phoning, and they report that sometimes, maybe a tenth of the time, the web front end is returning a generic error as they try to complete a workflow.