Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Monitoring-as-Code for Scaling Observability

As data volumes continue to grow and observability plays an ever-greater role in ensuring optimal website and application performance, responsibility for end-user experience is shifting left. This can create a messy situation with hundreds of R&D members from back-end engineers, front-end teams as well as DevOps and SREs, all shipping data and creating their own dashboards and alerts.

Lessons learned from building our first AI product

Since the advent of ChatGPT, companies have been racing to build AI features into their product. Previously, if you wanted AI features you needed to hire a team of specialists to build machine learning models in-house. But now that OpenAI’s models are an API call away, the investment required to build shiny AI has never been lower. We were one of those companies. Here’s our journey to building our first AI feature, and some practical advice if you’ll be doing the same.

Unleashing the power of AI and automation for effective Cloud Cost Optimization in 2024

In the current dynamic business environment, cloud computing has emerged as the fundamental driver of innovation and scalability. As companies increasingly rely on the cloud for their business initiatives achieving cloud cost optimization remains a significant hurdle.

Azure Resource Monitoring: Setting Up Key Metrics Made Simple!

Setting up Azure Monitor to oversee all essential metrics and points of interest across every single Azure resource in a solution can be a challenging task. It consumes considerable time, especially when dealing with individual Azure resources, and the effort multiplies when managing numerous Azure resources. This video demonstrates how Serverless360 alleviates some of these challenges, simplifying the process of setting up effective Azure monitoring.

Q&A: What IT Automation Best Practices Should You Know Right Now? - Part 2

With a limitless load of questions on IT automation and the industry’s biggest trends, Resolve’s “Ask Me Anything (AMA)” session went about tackling them in an all-new way. We threw out the preparation, we threw out the scripts, and we asked our community to submit the questions that matter most to them and their organizations. Part of our leadership team took the hot seat and provided answers in real time, sans dress rehearsal.

How to monitor Azure Automation Runbooks?

This video guides you through developing an Automation solution with Azure Automation accounts. The primary objective is to streamline the management and monitoring of these services for IT support operators, irrespective of their expertise in Azure. Mike demonstrates how you can add an automation runbook to a Business Application in Serverless360 to manage it alongside the other resources that make up your solution and how we can democratize some of the support to the IT support operators.

8 Strategies for Reducing Alert Fatigue

Site Reliability Engineers (SREs) and DevOps teams often deal with alert fatigue. It's like when you get too alert that it's hard to keep up, making it tougher to respond quickly and adding extra stress to the current responsibilities. According to a study, 62% of participants noted that alert fatigue played a role in employee turnover, while 60% reported that it resulted in internal conflicts within their organization.

Unlock the Secrets of Machine Learning: A Beginner's Guide with Josh Mesout - Navigate Europe 23

Dive into the world of machine learning with Josh Mesout. This video is a great starting point for beginners, offering a practical approach to understanding and applying machine learning concepts. Follow along as Josh demonstrates setting up a machine learning environment on Civo and explores a PyTorch notebook for handwriting recognition. Whether you're coding along or just watching, this session is packed with useful tips and resources for your machine learning journey. Don't forget to check out our GitHub repository for additional materials and join the conversation in the comments!

Supercharged with AI

One of the most painful parts of incident management is keeping on top of the many things that happen when you’re right in the middle of an incident. From figuring out and communicating what’s happening, to ensuring you learn from previous incidents, and even capturing the right actions – incidents are hard, but they don’t need to be this hard.