%term

Announcing Services Discovery for tracking and improving service reliability

Apr 27, 2021 By Matt Schillerstrom In Gremlin

Gremlin helps teams proactively improve the reliability of their systems by running chaos experiments on infrastructure including hosts, containers, and Kubernetes clusters. But as microservice-based architectures and automated cloud platforms become the norm, engineers are shifting their focus from managing infrastructure to managing services. In order to keep these services as resilient as possible, they need tools that can help them find failure modes, reduce incidents, and improve availability.

Read Post

Gremlin

Read more about Announcing Services Discovery for tracking and improving service reliability

Chaos Engineering in 60 seconds - Attack a service

Apr 27, 2021 By Gremlin In Gremlin

Learn how to run a chaos experiment on a distributed service using Services Discovery in Gremlin. Gremlin is the enterprise Chaos Engineering platform on a mission to help build a more reliable internet. Their solutions turn failure into resilience by offering engineers a fully hosted SaaS platform to safely experiment on complex systems, in order to identify weaknesses before they impact customers and cause revenue loss.

View Video

Gremlin

Read more about Chaos Engineering in 60 seconds - Attack a service

Announcing Services Discovery for tracking and improving service reliability

Apr 27, 2021 By Gremlin In Gremlin

Gremlin announces Services Discovery for tracking and improving the reliability of distributed services. Gremlin is the enterprise Chaos Engineering platform on a mission to help build a more reliable internet. Their solutions turn failure into resilience by offering engineers a fully hosted SaaS platform to safely experiment on complex systems, in order to identify weaknesses before they impact customers and cause revenue loss.

View Video

Gremlin

Read more about Announcing Services Discovery for tracking and improving service reliability

Announcing role based access control for API keys for more control over automation

Apr 22, 2021 By Matt Schillerstrom In Gremlin

Today, Gremlin is excited to announce the ability to create an API key that can perform actions with the same set of permissions as your user account. This allows you to automate Gremlin tasks safely and securely.

Read Post

Gremlin

Read more about Announcing role based access control for API keys for more control over automation

Creating Chaos to Achieve Reliability

Apr 22, 2021 By JJ Tang In Rootly

How can creating chaos achieve better reliability? Chaos and reliability might seem mutually exclusive, but through the use of Chaos Engineering, SREs can bring about meaningful changes to system resiliency.

Read Post

Rootly

Read more about Creating Chaos to Achieve Reliability

How Netflix Uses Fault Injection To Truly Understand Their Resilience

Apr 6, 2021 By Thomas Russell In Coralogix

Distributed systems such as microservices have defined software engineering over the last decade. The majority of advancements have been in increasing resilience, flexibility, and rapidity of deployment at increasingly larger scales. For streaming giant Netflix, the migration to a complex cloud based microservices architecture would not have been possible without a revolutionary testing method known as fault injection. With tools like chaos monkey, Netflix employs a cutting edge testing toolkit.

Read Post

Coralogix

Read more about How Netflix Uses Fault Injection To Truly Understand Their Resilience

Announcing our latest attacks to deal with meeting fatigue

Apr 1, 2021 By Gremlin In Gremlin

Gremlin empowers you to proactively root out failure before it causes downtime. See how you can harness chaos to build resilient systems by requesting a demo of Gremlin. With everyone working remotely, video conference tools like Zoom have been a critical part of maintaining business continuity. It’s truly amazing that we can continue to work and connect with one another, even during a time where getting together in an office hasn’t been possible…

Read Post

Gremlin

Read more about Announcing our latest attacks to deal with meeting fatigue

Validating the resilience of your API gateway with Chaos Engineering

Mar 4, 2021 By Andre Newman In Gremlin

Get started with Gremlin's Chaos Engineering tools to safely, securely, and simply inject failure into your systems to find weaknesses before they cause customer-facing issues. API gateways are a critical component of distributed systems and cloud-native deployments. They perform many important functions including request routing, caching, user authentication, rate limiting, and metrics collection. However, this means that any failures in your API gateway can put your entire deployment at risk.

Read Post

Gremlin

Read more about Validating the resilience of your API gateway with Chaos Engineering

What is fault injection?

Feb 16, 2021 By Andre Newman In Gremlin

When reading about Chaos Engineering, you’ll likely hear the terms “fault injection” or “failure injection.” As the name suggests, fault injection is a technique for deliberately introducing stress or failure into a system in order to see how the system responds. But what exactly does this mean, and how does this relate to Chaos Engineering?

Read Post

Gremlin

Read more about What is fault injection?

What is Chaos Engineering and How to Implement It

Feb 9, 2021 By Alex Mair In Coralogix

Chaos Engineering is one of the hottest new approaches in DevOps. Netflix first pioneered it back in 2008, and since then it’s been adopted by thousands of companies, from the biggest names in tech to small software companies. In our age of highly distributed cloud-based systems, Chaos Engineering promotes resilient system architectures by applying scientific principles. In this article, I’ll explain exactly what Chaos Engineering is and how you can make it work for your team.

Read Post