Operations | Monitoring | ITSM | DevOps | Cloud

Latest posts

Upskilling your Network Operations Center

Many organizations are heavily investing in AI and automation to remove the burden of manual work and operational efficiency. However to drive their wide scale adoption, they also need employees who can collaborate effectively with the technology. To bridge that gap, companies can use upskilling to retain talent, mitigate risks to the business, and allow employees to grow their careers.

Whose infrastructure is it anyway?

The recent McKinsey report, the state of cloud computing in Europe has exposed not only low returns, but also serious challenges for businesses embracing cloud as the basis of digital transformation. The first concern is that not only is the value of cloud ‘in isolated pockets and at subscale ’, but also that it is limited to the IT department. Whilst 75 percent of those surveyed reported either technology cost savings or productivity increases, only one-third have seen such savings beyond IT.

Why "why" is the wrong question to be asking after incidents with Dennis Henry of Okta

In last week’s episode of The Debrief, we had on Colette Alexander, Director of Engineering at HashiCorp, to discuss some of the myths around incident response. In that conversation, one of the myths we spoke about was the idea that asking “why” is better than asking “how.” And how, in reality, asking "how" allows you to focus more on the contributing factors that led to an incident happening, whereas “why” tends to single out a person, which can lead to a lot of blame.

EBS Pricing Explained: A Guide For 2024

Understanding Amazon Elastic Block Store (EBS) pricing is fundamental for any organization using AWS to manage their cloud costs effectively. Amazon EBS provides the storage your cloud applications need to run smoothly. However, it’s equally important to understand its pricing to keep your cloud spending in check. This guide aims to simplify Amazon EBS pricing and offers practical tips on managing and reducing these costs. But first, what is it?

#026 - Kubernetes for Humans Podcast with BJ Badyk (Nexxen)

BJ Badyk is a human who desires an easier life. Nerd from birth, his curiosity led him down a path through the start of ISPs, Silicon Valley during the dot-com bubble, the last few years of the Playboy brand, and into the world of Adtech. He currently runs the platform engineering team at Nexxen, where they work on unique ways of handling millions of requests per second with Kubernetes. The team was an early adopter of Talos Linux, which they now run at scale. He presented at TalosCon 2023 and continues to pursue simple solutions to complex problems.

Recurring 'Service Restart' Remediation with Resolve Actions

In this demonstration we break down Resolve's incident automation, which helps identify recurring IT issues by searching for previous incidents within a specific timeframe. If multiple incidents are detected, the automation flags the issue as chronic, updates the incident, and assigns it for further investigation to prevent endless retries. This system expedites resolving recurring IT issues. What you'll learn.