May 2024

How reliability differs between monolithic and microservice-based architectures

May 14, 2024 By Andre Newman In Gremlin

Microservices have forever changed the way we build applications. Tools like Docker and Kubernetes made microservice-based architectures widely accessible to software developers, and cloud platforms like Amazon EKS made deploying containers fast and inexpensive. They've also enabled even small engineering teams to deploy code faster, leverage fault tolerance and redundancy, scale more efficiently, and take full ownership of their services from development all the way into production.

Read Post

Gremlin

Read more about How reliability differs between monolithic and microservice-based architectures

How to run Chaos Engineering experiments in your CI/CD pipeline

May 10, 2024 By Gremlin In Gremlin

Part of the Gremlin Office Hours series: A monthly deep dive with Gremlin experts. Ad-hoc Chaos Engineering experiments are great for learning more about how your systems work, but they don’t tell you how your systems behave over time. As new features get deployed, environments change, and regressions get introduced, even the most resilient systems can gain reliability risks. QA and performance testing are already built into CI/CD - why not reliability?

View Video

Gremlin

Read more about How to run Chaos Engineering experiments in your CI/CD pipeline

How to build zone-redundant cloud instances and clusters

May 9, 2024 By Andre Newman In Gremlin

Redundancy is a core tenet of cloud computing. While major cloud platforms have high targets for reliability, they can still fail, and it’s important for teams to have a plan for when they do. But how can you build services that can withstand something as disruptive as a datacenter outage? In this blog, we’ll show you how to prepare for availability zone outages by proactively detecting services operating in a single zone.

Read Post

Gremlin

Read more about How to build zone-redundant cloud instances and clusters

Five ways Gremlin helps organizations meet DORA requirements

May 7, 2024 By Ryan Detwiller In Gremlin

Enacted by the European Union, the Digital Operational Resilience Act (DORA) establishes new standards for digital operational resilience in the financial sector. DORA changes the financial sector's approach to digital security and resilience by imposing stringent Information and Communication Technology (ICT) risk management, incident reporting, third-party risk management, and regular testing.

Read Post

Gremlin

Read more about Five ways Gremlin helps organizations meet DORA requirements

Three roles you need for reliability success

May 7, 2024 By Gavin Cahill In Gremlin

It’s one thing to say that reliability is a priority for your organization, and a whole other thing to make actual, demonstrable improvements in the availability of your applications. Sadly, it’s common for organizations to invest time, money, and effort into improving reliability only to barely nudge the needle on incidents and downtime. But there are hundreds of companies successfully improving their reliability posture—and doing it at enterprise scale.

Read Post

Gremlin

Read more about Three roles you need for reliability success

How to build reliable services with unreliable dependencies

May 2, 2024 By Andre Newman In Gremlin

In an earlier blog, we looked at slow dependencies and how they can impact the reliability of other services. While we explored what happens when dependencies are degraded, what happens when dependencies outright fail? What can you do when your application or service sends a request to another service, and nothing comes back? We’ll answer this question by using Gremlin to proactively test a service with multiple dependencies.

Read Post

Gremlin

Read more about How to build reliable services with unreliable dependencies

Operations | Monitoring | ITSM | DevOps | Cloud

May 2024

How reliability differs between monolithic and microservice-based architectures

How to run Chaos Engineering experiments in your CI/CD pipeline

How to build zone-redundant cloud instances and clusters

Five ways Gremlin helps organizations meet DORA requirements

Three roles you need for reliability success

How to build reliable services with unreliable dependencies

Monthly Archive

Follow Us