Reliability is about more than uptime

Jul 22, 2025

Reliability results are more than whether your application is up, it's about proactive measurement and keeping it up. Find out how to be reliable with Gremlin → https://www.gremlin.com/

Full transcript:

 Reliability results in my earlier career was, "Is there any downtime? Are there any errors that are getting thrown?" It's not a proactive way to measure your reliability.

If you're measuring it in time of production, it's not gonna be an accurate reflection of what your reliability is. The way that my mindset has changed over time has been a proactive measurement. Before we ship something out, is this gonna be reliable from the start?

So we're measuring in our local environment, in our staging environment, and in our production environment. And it's an ongoing effort to make sure that your software engineers have peace of mind, your executive level has peace of mind, your customers have peace of mind.

It's not just about keeping your application on, it's about a culture that everyone is bought into of making sure that your application is doing what it's supposed to be doing at all times.