Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

What Is Site Reliability Engineering? Understanding the complexities of this crucial function

Site reliability engineers manage a lot, and often in incredibly high-stakes environments. Remember that scene from "The Matrix" where Neo dodges bullets in slow motion? Of course you do. As an SRE, it can feel like you're the person getting hit by those bullets, frantically trying to investigate performance issues, automate away toil, and support the engineers around you, all before the next wave of attacks.

IT Asset Management Software in 2024

In today’s fast-paced technological landscape, organizations heavily rely on IT assets to drive their operations and deliver value to customers. Efficient and effective management of these assets is crucial for ensuring optimal performance, cost savings, and regulatory compliance. This is where IT Asset Management (ITAM) software plays a vital role.

What is Business Continuity and Disaster Recovery (BCDR)?

Perhaps the worst IT scenario an organization can face is an unexpected and forced suspension of all its operations. The downtime that’s experienced in such a situation can lead to financial damages that far exceed those from lost data or hits to reputation. While cyberattacks vary in intensity and approach, downtime and catastrophic loss of data come in many more forms and are equally, if not more, difficult to avoid.

Discover, Learn, and Experience: The Qovery Playground is Now Open!

In the dynamic world of development and operations (DevOps), one thing is clear: there's always room for new, innovative platforms that make life easier for developers and platform engineers. And today, we're thrilled to introduce our latest contribution to this dynamic sphere – the Qovery Playground.

Celebrating Grafana 10: Top 10 Grafana features you need to know about

Since Grafana started 10 years ago, there have been more than 43,000 commits to the open source project. Grafana founder Torkel Ödegaard has made more than 7,600 of those commits, and he recently reflected on some personal favorites he’s worked on, ranging from early query builders to the latest navigation updates. Torkel isn’t the only one who has strong feelings.

How our engineering team uses Polish Parties to maintain quality at pace

It’s fair to say that delivering software faster has never been more relevant. But in doing so, it’s easy to let your bar for quality slip. Often, the guardrail to avoid this is to hire dedicated QA Engineers, whose sole job is to ensure your software works as it should and to spot any issues that arise. Seems sensible, right? Well, at incident.io, we take a different approach.

10 Key Application Performance Metrics & How to Measure Them

If you are trying to figure out how to measure the performance of your application, you are in the correct place. We spend a lot of time at Stackify thinking about application performance, especially about how to monitor and improve it. In this article, we cover some of our most important application performance metrics you should be tracking.

9 Foolproof Ways to Protect Your Employee Privacy

Cybercrime is one of the greatest threats to modern businesses, so naturally, cybersecurity is becoming a top priority among business leaders worldwide. It’s easy to see why. Cybercrime can result in significant losses for companies, particularly financially. In fact, the global cost of cybercrime is expected to rise to $23.84 trillion by 2027, up from $8.44 trillion in 2022. Businesses collect and store mountains of data daily, and are obligated to keep that data secure.

Unearthing Gold: Deriving Metrics from Logs with Mezmo Telemetry Pipeline

Logs are like gold ore. They have valuable nuggets of information, but those nuggets often come in a matrix of less helpful material. Extracting the gold from the ore is crucial because it is vital to unlocking insights and optimizing your system(s). Raw logs can be overwhelming, containing informational messages, debug statements, errors, etc. However, buried within this sea of data lies the key metrics you can use to understand your applications' performance, availability, and health.