Reliably

London, UK
2020
  |  By Ryan Green
The consequences of downtime and data breaches can be devastating to organizations, leading to substantial financial losses and irreparable damage to a business’s reputation. If last week's outage by the Bank of England is anything to go by, after losing trillions of £’s per day due to downtime, resilience shouldn’t just be an afterthought for organizations.
  |  By Aimee Pearcy
Reliability engineering focuses on the ability of systems to perform as it is intended to and function without failure in a specified environment, for the required time duration. Reliability engineering can be applied across the entire lifecycle of software development. It is designed to increase the dependability of a product by detecting potential reliability issues early in the software development cycle, and correcting causes of failure that do occur.
  |  By Mbaoma Mary
A code freeze means no code can be altered or modified during the frozen time, and developers will not make any additional changes. Developers can only modify the code in the event of critical flaws and to the extent required to correct those vital problems. Primarily developers observe a code freeze during the final phase of software development when the software product has reached the delivery state.
  |  By Aimee Pearcy
Software reliability can be defined as the probability of a failure-free operation of a computer system over a specified period, under a set of specific conditions. It is an important factor in determining software quality. Site reliability engineering (SRE) is a software approach to IT operations that helps organizations to improve the reliability of their systems.
  |  By Mboama Mary
Since Google coined the term, the role of an SRE has evolved as the industry has shifted toward large-scale distributed microservices. An SRE’s job is to determine how to make systems more reliable and resilient.
  |  By Catrin Haberfield
It can be a big can of worms, but tackling IT downtime can be the first step to major cost savings. Here’s everything you need to know about downtime but were too afraid to ask.
  |  By Catrin Haberfield
Is 99.999% uptime realistic? We cover why you should care, and how you can achieve it.
  |  By Mbaoma Mary
A system’s reliability is one of the most important things that engineers should care about. They ensure customers are kept happy and keep organizations profitable. Investing in reliable processes and tools to ensure systems are reliable can be critical to company success. Site Reliability platforms are popular choice when it comes to monitoring and observing software services as they help make responding to and solving application problems easier.
  |  By Aimee Pearcy
Alert fatigue occurs when people become desensitized to the overwhelming number of alerts they receive and are expected to respond to. Even though these alerts are typically easy to respond to, it is the sheer number of them that ultimately causes people to feel fatigued. The higher the number of alerts, the more likely it is that employees are likely to begin to ignore and potentially miss an important alert leading to bigger consequences.
  |  By Sylvain Hellegouarch
Improving team health within DevOps is vital for success in any engineering team. In this article, we’ll look at some of the ways that you can improve team health with Reliably so you can keep your developers happier, healthier and free from burnout.
  |  By Reliably
In this video, we're going to ask Reliably Assistant to create an experiment that takes down a full AWS AZ.#chaosengineering #aws #reliability.
  |  By Reliably
In this video we're asking Reliably Assistant to create an experiment that adds latency between two Kubernetes services.
  |  By Reliably
On October 26th 2023, we had the pleasure of receiving Manuel Castellin, a seasoned expert in chaos engineering and Terraform, who took us through two real-world examples demonstrating how to overcome the challenges of implementing chaos engineering when your infrastructure isn’t initially prepared for it and securely experiment on production systems. In the second part of the meetup, Sylvain Hellegouarch, Chaos Toolkit lead developer and Reliably CEO, showed a quick demo of how to use Reliably to build your experiments in a less code-centric and more visual way.
  |  By Reliably

#kubernetes #eks #chaosengineering

  |  By Reliably

#reliably #chaosengineering #honeycomb #slack #resilience

  |  By Reliably

#reliably #chaosengineering #github #githubactions #resilience
Reliably lets you run experiments not only from the Reliably cloud but from your own environment. This video will focus on running a chaos engineering experiment in GitHub.

  |  By Reliably

#reliably #chaosengineering #resilience #kubernetes #k8s
Reliably lets you run experiments not only from the Reliably cloud but from your own environment. This video will focus on running a chaos engineering experiment in a Kubernetes cluster.

  |  By Reliably

#reliably #chaosengineering #resilience
In this video, we'll show how you can use starters and the Reliably cloud to get started very quickly.

  |  By Reliably
Learn how to use an existing experiment to create a template that users in your organization will be able modify to create their own experiments in minutes.
  |  By Reliably
Experience everything great about Chaos Experiments with the added benefits of our reliability platform. See Reliably in action and discover how to proactively verify with Chaos engineering so you can anticipate impacts on your users and prioritize appropriate actions safely.

Reliably is here to assist engineering teams and organisations to get better at operating with greater predictability and less anxiety.

We've built Reliably so you can to start creating and using Objectives instantly. Let the CLI populate command automatically explore your infrastructure and suggest meaningful objectives, run the agent from your CI/CD, and you're ready to go.

Reliably puts the high level of intelligence and abstraction brought by objectives in the hands of those responsible for the reliability of systems and services: developers.

Engineer your reliability culture.