Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Resiliency As the Next Step in the DevOps Transformation

We’ve reached the point in the DevOps transformation where efficiency and automation are no longer the highest objectives. The next step is engineering past automation and towards fully autonomous, self-healing systems. If you aren’t conversing about building this type of resilience into your systems and applications, there’s never been a better time than now to start.

Essential tips for automating DevOps workflows

Implemented well, automation can be a powerful tool for accelerating and scaling DevOps processes to keep your team building and shipping code quickly. But knowing what and how to automate DevOps workflows can be challenging; every organization’s DevOps practices are unique, and there’s no one “right” way to approach automation. Let’s look at a few tips for approaching DevOps workflow automation to help your team move faster.

How To Handle Untaggable And Untagged Cloud Spend

Let’s imagine, for a moment, that we live in a perfect world. In that world, you could check your company’s cloud bills and financial reports and find cleanly organized categories of spending that help you instantly understand where your money is going and why. Your engineers would meticulously label every spend item with useful metadata tags so you can clearly see which costs have increased and which are most affecting your bottom line.

Strategies to Align AI Data Collection and Management with DevOps Practices

DevOps is characterized by the acceleration of processes to ensure continuous delivery without compromising high software quality. Balancing speed and quality is quite a challenging task, though. Data issues are among the most significant problems encountered by DevOps teams. These can be worse in the context of AI development, where massive amounts of data play a crucial role in machine learning.

SRE Signals: 3 Types of Metrics for Site Reliability Engineering

Site Reliability Engineering, or SRE, is a widely-used set of interdisciplinary practices that help increase the efficiency of software development. But, aside from that, its purpose is to create scalable, connected, reliable, communicated systems that keep providing better, more reliable results. SRE leads to more connected, efficient organizations that can build resilient, iterable, and scalable software. To do this, SRE engineers leverage their coding expertise.