%term

The latest News and Information on Service Reliability Engineering and related technologies.

Guide to Service Level Indicators and Setting Service Level Objectives

Nov 8, 2022 By Last9 In Last9

A guide to set practical Service Level Objectives (SLOs) & Service Level Indicators (SLIs) for your Site Reliability Engineering practices.

Read Post

Last9

Read more about Guide to Service Level Indicators and Setting Service Level Objectives

Introducing a more complete logs forwarding experience

Nov 7, 2022 By Prineet Kaur Bhurji In Upsun

One of the key attributes of DevOps and SRE engineers is their ability to meticulously observe and monitor all of their applications. A task which can be achieved more efficiently by centralizing all generated logs to a central endpoint. By centralizing logging, engineers can, at any time, have an accurate overview of all events which take place across their applications, from just one place. Storing logs in an external system also allows companies to ensure compliance with many certifications.

Read Post

Upsun

Read more about Introducing a more complete logs forwarding experience

Why 'owning Services' is critical for effective Incident Response

Oct 31, 2022 By Vardhan NS In Squadcast

There is a famous quote that goes like this…‘For every minute spent organizing, an hour is earned.’ At least in the world of incident response, nothing is more apt than this. Digital infrastructure these days is made up of multiple services, an outage could result from either one impacted service or multiple impacted services. So it's essential to have a catalog of all the services along with the point of contact (service owner) responsible for maintaining it.

Read Post

Squadcast

Read more about Why 'owning Services' is critical for effective Incident Response

On Building a Platform Team

Oct 31, 2022 By Jess Mink In Honeycomb

It may surprise you to hear, but Honeycomb doesn’t currently have a platform team. We have a platform org, and my title is Director of Platform Engineering. We have engineers doing platform work. And, we even have an SRE team and a core services team. But a platform team? Nope. I’ve been thinking about what it might mean to build a platform team up from scratch—a situation some of you may also be in—and it led me to asking crucial questions. What should such a team own?

Read Post

Honeycomb

Read more about On Building a Platform Team

Routing alerts from AWS Elastic Beanstalk via CloudWatch

Oct 27, 2022 By Vishal Padghan In Squadcast

Amazon Web Services (AWS) offers 100+ services, each focusing on a specific area of functionality. However, it can be challenging to pick the right services for the task and also to provision them. AWS Elastic Beanstalk, lets you easily deploy and manage applications without the need to learn about the underlying infrastructure that runs these applications.

Read Post

Squadcast

Read more about Routing alerts from AWS Elastic Beanstalk via CloudWatch

Introduction to Automation Testing Strategies For Microservices

Oct 25, 2022 By Rajiv Srivastava In Squadcast

Microservices are distributed applications deployed in different environments and could be developed in different programming languages having different databases with too many internal and external communications. A microservice architecture is dependent on multiple interdependent applications for its end-to-end functionalities. This complex microservices architecture requires a systematic testing strategy to ensure end-to-end (E2E) testing for any given use case. In this blog, we will discuss some of the most adopted automation testing strategies for microservices and to do that we will use the testing triangle approach.

Read Post

Squadcast

Read more about Introduction to Automation Testing Strategies For Microservices

Authors' Cut-Gear up! Exploring the Broader Observability Ecosystem of Cloud-Native, DevOps, and SRE

Oct 13, 2022 By Liz Fong-Jones In Honeycomb

You know that old adage about not seeing the forest for the trees? In our Authors’ Cut series, we’ve been looking at the trees that make up the observability forest—among them, CI/CD pipelines, Service Level Objectives, and the Core Analysis Loop. Today, I'd like to step back and take a look at how observability fits into the broader technical and cultural shifts in technology: cloud-native, DevOps, and SRE.

Read Post

Honeycomb

Read more about Authors' Cut-Gear up! Exploring the Broader Observability Ecosystem of Cloud-Native, DevOps, and SRE

SRE Fundamentals: Everything you need to know

Oct 13, 2022 By Cortex In Cortex

Google has had an outsized impact on the world, from its unrivaled search engine to its expansion into a range of customer-focused services. It would be difficult to make an impact of this magnitude without also leading the way in the software development industry. One of its biggest contributions to the community is a set of principles known as site reliability engineering or SRE.

Read Post

Cortex

Read more about SRE Fundamentals: Everything you need to know

Setting better SLOs using Google's Golden Signals

Oct 11, 2022 By Andre Newman In Gremlin

To many engineers, the idea that you can accurately and comprehensively track your application's user experience using just a few simple metrics might sound far-fetched. Believe it or not, there are four metrics that aim to do just that. They're called the four Golden Signals and should be a core part of your observability and reliability practices.

Read Post

Gremlin

Read more about Setting better SLOs using Google's Golden Signals

How Many SREs Does Your Company Need? Here's How to Decide

Oct 9, 2022 By JJ Tang In Rootly

So you’ve decided to take advantage of Site Reliability Engineering by hiring SREs for your company. Now, you have a second decision to make: Exactly how many SREs to hire. Do you need just one or two SREs? Or should you build a sprawling SRE team, with a dozen or more SREs on hand to support your organization’s reliability needs? The answers to these questions will, of course, vary; every business’s needs are different.

Read Post