Operations | Monitoring | ITSM | DevOps | Cloud

January 2022

DevOps Tools (All of the Tools Your Team Needs)

Wondering about DevOps Tools? We explain the best tools for every step of the DevOps development process. What are DevOps Tools used for? DevOps relies on effective tools to help teams manage the entire software development lifecycle. These tools can automate tasks, monitor applications, and facilitate sharing of information between teams.

DevOps Methodology | Goals, Principles & Process

Wondering what DevOps Methodology is all about? We will explain what it is, how it works, and the principles and processes that make it successful. What is DevOps Methodology? DevOps methodology is a development process where Development and IT Operations collaborate throughout the lifecycle to facilitate faster deployment of reliable software products.

Introducing CommsFlow for Context-Rich and Timely Updates to All Stakeholders

We’re so excited to announce our latest platform feature, CommsFlow™! This addition to the core Blameless product offering allows teams to keep stakeholders updated as the reliability of services and applications change. With our new automated and customizable communication flows, on-call, engineering, and business teams feel a sense of accomplishment and, of course, stay informed.

Canary Deployments | The Benefits of an Iterative Approach

At Blameless, we want to embrace all the benefits of the SRE best practices we preach. We’re proud to announce that we’ve started using a new system of feature flagging with canaried and iterative rollouts. This is a system where new releases are broken down and flagged based on the features each part of the release implements. Then, an increasing subset of users are given access to an increasing number of features.

SRE and the Practice of Practice

Part of the trepidation of being on-call is encountering unfamiliar emergency scenarios where we are surprised by suddenly not knowing how to do our jobs. We feel lost and alone, complicated by the world around us, powerless to resolve or even mitigate the problem. On-call need not be a solo affair full of fear and anxiety. There are ways we can employ practice and open collaboration outside of incidents to prepare us better.

The Universal Language: Reliability for Non-Engineering Teams

We talk about reliability a lot from the context of software engineering. We ask questions about service availability, or how important it is for specific users. But when organizations face outages, it becomes immediately obvious that the reliability of an online service or application is something that impacts the entire business with significant costs. A mindset of putting reliability first is a business imperative that all teams should share.

Building an SRE Team with Specialization

As organizations progress in their reliability journey, they may build a dedicated team of site reliability engineers. This team can be structured in two major ways: a distributed model, where SREs are embedded in each project team, providing guidance and support for that team; and a centralized model, where one team provides infrastructure and processes for the entire organization.