Monthly Archive

How to define and measure the reliability of a service

Jul 14, 2022 By Andre Newman In Gremlin

More and more teams are moving away from monolithic applications and towards microservice-based architectures. As part of this transition, development teams are taking more direct ownership over their applications, including their deployment and operation in production. A major challenge these teams face isn't in getting their code into production (we have containers to thank for that), but in making sure their services are reliable.

Read Post

Gremlin

Read more about How to define and measure the reliability of a service

How Gremlin's reliability score works

Jul 14, 2022 By Andre Newman In Gremlin

In order to make reliability improvements tangible, there needs to be a way to quantify and track the reliability of systems and services in a meaningful way. This "reliability score" should indicate at a glance how likely a service is to withstand real-world causes of failure without having to wait for an incident to happen first. Gremlin's upcoming feature allows you to do just that.

Read Post

Gremlin

Read more about How Gremlin's reliability score works

Chaos Engineering Tools: Build vs Buy

Jul 8, 2022 By Gremlin In Gremlin

Chaos Engineering, where engineers intentionally inject failure to test the reliability of their systems, is becoming a regular practice for companies who value uptime and availability. As cloud-based systems have grown more complex, Chaos Engineering has become a critical part of the software testing and release process to uncover surprise dependencies, fix problems before they become 3am outages, and bake reliability into every feature.

Read Post

Gremlin

Read more about Chaos Engineering Tools: Build vs Buy

Operations | Monitoring | ITSM | DevOps | Cloud

How to define and measure the reliability of a service

How Gremlin's reliability score works

Chaos Engineering Tools: Build vs Buy

Monthly Archive

Follow Us