Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Common Nagios Errors and What to Do about Them

Nagios is an open-source monitoring system that has become indispensable for system administrators and DevOps teams across the world. However, like any other software, you’re bound to come across errors with Nagios. In this article, we’re going to take a look at some common errors and how to solve them, along with the pros and cons of Nagios, and why MetricFire is the perfect alternative for monitoring.

Blue Matador + Squadcast: Alert Routing Simplified

Blue Matador is the fastest, easiest way to set up AWS infrastructure monitoring, allowing small teams to fully monitor their cloud operations with no manual setup. If you use Blue Matador for your cloud monitoring requirements, you can integrate it with Squadcast, an end-to-end Incident Response tool, to route alerts from Blue Matador to the right users in Squadcast with ease.

How do you measure software security maturity?

Scorecards are a Cortex feature that allow you to understand how well your services are doing on the metrics you care about. Scorecards are customizable to your needs, however several are common to most organizations. In our previous post, we shared the top three scorecards that we recommend to Cortex customers. Security maturity is one of the first scorecards we recommend organizations create.

Jad Jebara on Reinventing DCIM: Optimizing Hybrid Infrastructures with Hyperview

In an exclusive Digitalisation World podcast, our CEO, Jad Jebara, delves deep into the ever-evolving hybrid infrastructure landscape. Join us as we explore how companies are strategically optimizing application performance and the infrastructure that fuels their digital ambitions.

NOC Success Like Never Before: Automation Strategies for All-new Incident Management

Network Operations might never be the same. But then again, why would anyone want it to be? The power of automation and orchestration can bring incredible value to the Network Operations Center (NOC), including the business-critical call to get proactive and ahead of the incidence response and management game. It’s more than a towering volume of events – it’s the complexities involved, too.

Why Real-Time Debugging Becomes Essential in Platform Engineering

Platform engineering has been one of the hottest keywords in the software community in recent years. As a natural extension of DevOps and the shift-left mentality it fosters, platform engineering is a subfield within software engineering that focuses on building and maintaining tools, workflows, and frameworks that allow developers to build and test their applications efficiently.

Don't just build a dashboard! A DORA cautionary tale

Don't just build a dashboard! A DORA cautionary tale. Software delivery success isn't just about dashboards and metrics. You also need to think about how to improve as an engineering team. The point of the DevOps Research Assessment (DORA) is improvement. Give Sleuth a try and see how we give teams actionable insights on how to improve, no-code automations to instantly ship improvements, and metrics to measure their impact — all in a way that both managers and developers love.

A detailed guide on Azure Reservations

Organizations that invest in cloud technologies like Microsoft Azure might notice that cloud costs can easily get out of control. When cloud services use the Pay-as-you-go payment model, small amounts must be paid each time a cloud service is used. However, when you have deployed hundreds of cloud resources, the total spending can end up with a much higher monthly bill than expected. By optimizing your cloud costs, you lower your monthly Azure bill and gain cost efficiency and predictability.