September 2021

Reliability is not an engineering metric

Sep 30, 2021 By Robert Ross In FireHydrant

If you're an engineer reading this, you might be wondering what I mean by the title. You might be a Site Reliability Engineer whose primary responsibility is to maintain the reliability of your company’s product/solution. You might be a software builder, a programmer responsible for building new capabilities and shipping them to production. All of these are important for any business to remain competitive.

Read Post

FireHydrant

Read more about Reliability is not an engineering metric

SRE Back-to-School Checklist

Sep 23, 2021 By Emily Arnott In Blameless

Whether it's in classrooms or on Zoom calls, the kids have headed back to school! Bright-eyed students are gearing up to study new subjects and test their brains. Hopefully on their report cards, failure isn’t inevitable. Before the first day, parents load up their kids’ backpacks with everything they’ll need. Being well equipped with good supplies is the best way to stay focused and educate “reliably”. Likewise, SREs need the right tools and practices for the job.

Read Post

Blameless

Read more about SRE Back-to-School Checklist

Modern SRE Practices for Incident Management

Sep 23, 2021 By Mary Chen In VMware Tanzu

At VMware, we make use of modern development and site reliability engineering (SRE) practices on a regular basis. And those of us who work on the VMware Tanzu Observability product marketing team regularly get exposure to various SRE teams that implement modern practices with the observability technology we create.

Read Post

VMware Tanzu

Read more about Modern SRE Practices for Incident Management

What is expected in the SRE role? We analyzed 30 job postings to find out.

Sep 21, 2021 By Pruthvi In Spike

In 2016, Google released the definitive book on Site Reliability Engineering (SRE) - a practice that had originated in the company to take care of a monumental problem - how to keep the Google services running with high reliability. Over the years, SRE has been widely adopted by dev teams across the globe and is a popular role at startups and enterprises alike. Here is a look at how search for SRE has trended over the years.

Read Post

Spike

Read more about What is expected in the SRE role? We analyzed 30 job postings to find out.

A Migration That Paid Tech Dividends

Sep 20, 2021 By Darrell Pappa In Blameless

TL;DR: Old, deprecated code/infrastructure is a challenge that every engineer will come across. Remedy what you can and remember that some extra effort can go a long way. It can uncover issues that, when addressed, will save you in the future. Part of the challenge of software development is maintaining legacy code and infrastructure. When you ignore or neglect these, issues start to pop up and your reliability suffers, causing pain for your customers. The trick here is to actively steward each project.

Read Post

Blameless

Read more about A Migration That Paid Tech Dividends

SRE vs. DevOps: What are the Differences?

Sep 19, 2021 By Mateus Gurgel In Rootly

SRE and DevOps are closely related concepts, and many businesses can benefit from embracing both of them. Nonetheless, there are important distinctions between SRE and DevOps.

Read Post

Rootly

Read more about SRE vs. DevOps: What are the Differences?

DevOps Lifecycle: Step-By-Step Walkthrough & Best Practices

Sep 14, 2021 By Emily Arnott In Blameless

Want to know more about the DevOps lifecycle? We explain the seven phases in DevOps, and how each one plays a vital role in the development process.

Read Post

Blameless

Read more about DevOps Lifecycle: Step-By-Step Walkthrough & Best Practices

Going from Zero to SRE

Sep 14, 2021 By Ricardo Castro In Squadcast

Establishing a formal SRE practice can be either a 'nice-to-have' or a 'must-have' depending on org size, and team structure among other important factors. In this blog, Ricardo Castro shares his thoughts on the key SRE principles that every organization must incorporate and when they should incorporate in their SRE journey.

Read Post

Squadcast

Read more about Going from Zero to SRE

What is an SRE?

Sep 9, 2021 By JJ Tang In Rootly

A comprehensive definition of SREs and Site Reliability Engineering, including what SREs do and what makes SREs different from other roles.

Read Post

Rootly

Read more about What is an SRE?

DevOps vs. Agile

Sep 8, 2021 By Emily Arnott In Blameless

Curious about the differences between DevOps vs. Agile development methodologies? We'll explore and compare both approaches. What are the key differences between DevOps vs. Agile? Agile and DevOps are methodologies that share the goal of producing software quickly. In DevOps, Development and Operations work together closely throughout the software development lifecycle process. Agile is an iterative approach that focuses on deploying releases rapidly with small teams.

Read Post

Blameless

Read more about DevOps vs. Agile

How Lowe's SRE reduced its mean time to recovery (MTTR) by over 80 percent

Sep 7, 2021 By Shyam Palani In Google Operations

The stakes of managing Lowes.com have never been higher, and that means spotting, troubleshooting and recovering from incidents as quickly as possible, so that customers can continue to do business on our site. To do that, it’s crucial to have solid incident engineering practices in place. Resolving an incident means mitigating the impact and/or restoring the service to its previous condition.

Read Post

Google Operations

Read more about How Lowe's SRE reduced its mean time to recovery (MTTR) by over 80 percent

The Role of SREs in Observability

Sep 3, 2021 By Quentin Rousseau In Rootly

Although conversation about observability often ignores SREs, SREs have a central role to play in observability success.

Read Post

Rootly

Read more about The Role of SREs in Observability

Essential Tools for Site Reliability Engineers

Sep 2, 2021 By Ritika Bramhe In OnPage

Site reliability engineers (SREs) are involved in scaling systems and making them reliable and efficient for organizations. But SREs often fail to build system resiliency when they do not have the right tools at their disposal. In this post, we’ll uncover five leading tools that SREs can use to drive the reliability and stability of computing systems. It also examines how SREs can use the tools to improve operations tasks and infrastructure processes.

Read Post

OnPage

Read more about Essential Tools for Site Reliability Engineers

Container Orchestration Explained Simply

Sep 1, 2021 By Emily Arnott In Blameless

Wondering about container orchestration? When your organization manages too many containers, you start to need container orchestration. We'll explain. What is container orchestration? Container orchestration is the automation of many operational tasks in your container-based applications.

Read Post

Blameless

Read more about Container Orchestration Explained Simply

Operations | Monitoring | ITSM | DevOps | Cloud

September 2021

Reliability is not an engineering metric

SRE Back-to-School Checklist

Modern SRE Practices for Incident Management

What is expected in the SRE role? We analyzed 30 job postings to find out.

A Migration That Paid Tech Dividends

SRE vs. DevOps: What are the Differences?

DevOps Lifecycle: Step-By-Step Walkthrough & Best Practices

Going from Zero to SRE

What is an SRE?

DevOps vs. Agile

How Lowe's SRE reduced its mean time to recovery (MTTR) by over 80 percent

The Role of SREs in Observability

Essential Tools for Site Reliability Engineers

Container Orchestration Explained Simply

Monthly Archive

Follow Us