History of SRE: Why Google Invented the SRE Role
A history of Site Reliability Engineering from its origins at Google in 2003 to the present.
The latest News and Information on Service Reliability Engineering and related technologies.
A history of Site Reliability Engineering from its origins at Google in 2003 to the present.
In one of our previous articles, we discussed what an SRE is, what they do, and some of the common responsibilities that a typical SRE may have, like supporting operations, dealing with trouble tickets and incident response, and general system monitoring and observability. In this article, we will take a deeper dive into the various SRE principles and guidelines that a site reliability engineer practices in their role.
Follow these steps to write a great SRE job resume.
The need for relevant and contextual telemetry data to support online services has grown in the last decade as businesses undergo digital transformation. These data are typically the difference between proactively remediating application performance issues or costly service downtime. Distributed tracing is a key capability for improving application performance and reliability, as noted in SRE best practices.
An explanation of the meaning of SLA, SLO and SLI, and how SREs should use each concept to manage reliability.