Operations | Monitoring | ITSM | DevOps | Cloud

November 2021

How Sabre is using SRE to lead a successful digital transformation

Editor’s note: Today we hear from Kenny Kon, an SRE Director at Sabre. Kenny shares about how they have been able to successfully adopt Google’s SRE framework by leveraging their partnership with Google Cloud. As a leader in the travel industry, Sabre Corporation is driving innovation in the global travel industry and developing solutions that help airlines, hotels, and travel agencies transform the traveler experience and satisfy the ever-evolving needs of its customers.

SLA vs. SLO (Differences Explained)

Wondering about SLAs and SLOs? We explain service level agreements and service level objectives, their differences, and the importance of each. What are the major differences between service level agreements (SLAs) and service level objectives? An SLA is a legal agreement between the business and the customer that includes a reliability target and the consequences of failing to meet it. An SLO is an internal target that measures how customers use the service.

DevOps Benefits & How to Maximize Them for Your Team

Curious about DevOps benefits? Whether you are just adopting DevOps or improving your current process, we explain the top benefits and how to maximize them. What are DevOps benefits? In DevOps, the operations and development work closely together during the entire software development lifecycle. The collaborative approach in DevOps leads to many benefits, including.

How to Write Meaningful Retrospectives

One of the foundations of incident management in SRE practice is the incident retrospective. It documents all the learnings from an incident and serves as a checklist for follow-up actions. If we step back, there are 7 main elements to a retrospective. When done right, these elements help you better understand an incident, what it reveals about the system as a whole, and how to build lasting solutions.

SRE Principles: The 7 Fundamental Rules

In one of our previous articles, we discussed what an SRE is, what they do, and some of the common responsibilities that a typical SRE may have, like supporting operations, dealing with trouble tickets and incident response, and general system monitoring and observability. In this article, we will take a deeper dive into the various SRE principles and guidelines that a site reliability engineer practices in their role.

How to improve your influence as an SRE

Improving your influence over the company will help you deliver high quality work as your goals will be closely aligned with those of the company. In this blog piece, Ricardo has explained how to improve your influence as an SRE. Balancing fast-paced business requirements with the demands of keeping production services stable is not an easy task.

Enabling SRE best practices: new contextual traces in Cloud Logging

The need for relevant and contextual telemetry data to support online services has grown in the last decade as businesses undergo digital transformation. These data are typically the difference between proactively remediating application performance issues or costly service downtime. Distributed tracing is a key capability for improving application performance and reliability, as noted in SRE best practices.

A practical Guide to implementing SRE

This eBook gives you practical steps to implementing SRE practices in an organization that's already invested in DevOps. It outlines the clear benefits and lays out how they can be achieved. Three main topics are covered: Incident Management, Service Level Objectives (SLOs), and SRE Culture. Leveling up these critical aspects of SRE will reap both immediate and long-term benefits.

Podcast: Break Things on Purpose | Gustavo Franco, Senior Engineering Manager at VMWare

In this episode Jason is joined by Gustavo Franco, Senior Engineering Manager at VMWare, to chat about chaos in the Gustavo’s early days. Gustavo reflects on Googles early disaster recovery practices, to the contemporary SRE movement.

How they SRE: Insights from the Cloudflare SRE team

Cloudflare is a global cloud services provider that is based all over the globe, from San Francisco, US to London, England to Sydney, Australia. Their mission, as stated front and center on their homepage, is to help build a better Internet. While that may read like hyperbole, their numbers are impressive - Cloudflare has over 126,000 paying customers and 95% of Internet Users in the developed world are within 50ms of their network.

DevOps Culture: How to Build a Stronger Team

Trying to improve your DevOps team? We’ll explain what DevOps culture is, how it benefits your team, and how you can build it within your organization. So what is DevOps culture? The main goals of DevOps culture are to increase collaboration and communication between teams, to give all participants a shared responsibility in the project, and to emphasize learning opportunities instead of spreading blame when things go wrong.