Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Using distributed tracing to identify bottlenecks in your app flows

As an engineer building a distributed application, every now and then I need to look for and analyze bottlenecks in our system. There can be several triggers for conducting a bottleneck analysis, for example: In this blog post I’ll share how I’ve been using our own product, Helios, and the power of distributed tracing, to help pinpoint bottlenecks in our system and resolve them fast.

What You Need to Know About ITIL for Service Management

As the person on the front lines, you know that providing the best service possible can be what makes your ITSM organization succeed. Every day, you work to build the relationships that help your organization create value for end-users. However, when you have inefficient processes, you end up having to be the person responding to an upset user.

Business Continuity vs. Business Resilience: Comparing Strategies for Staying Resilient

If there is one thing organizations can take away from the past few years, it's that they are far more vulnerable than they could realize before. From pandemics to critical supply shortages to widespread data breaches and natural disasters, businesses that don’t have plans in place to handle and respond to emergencies are at tremendous risk. As leaders plan for inevitable crises and disruption, interest in business resilience and continuity grows.

Held for Ransom - Ransomware Detection & Response with Flowmon ADS

Flowmon Anomaly Detection System takes an AI-based approach to detecting and alerting on the presence of threat actors within your network from the point of initial access all the way through to exploitation. Gaining visibility into a Ransomware attack by mapping a threat actors earliest movements within your network enables you to stop the attack in its infancy. Flowmon's forensic visibility has you covered with all of the evidence you will need to conduct your investigation following an attack attempt.

Global Health Institute Swiss TPH trusts in Icinga

We’re proud of our many customers and users around the globe that trust Icinga for critical IT infrastructure monitoring. That’s why we’re now showcasing some of these enterprises with their Success stories. It’s stories from companies or organizations just like yours, of any size and different kinds of industries. Some of them are our long-standing customers, others have just recently profited from migrating from another solution to Icinga.

Routing Strategies for Security and Observability Data: How to Make the Most of Your Data at Scale

Data routing is a crucial but complex task for companies of all sizes. Ensuring that the right data is sent to the right tools can be a time-consuming and difficult process, and when things go wrong, it can have costly consequences. This is why having a robust data routing strategy is essential for any organization.

3 Website Reliability Metrics Councils Should Be Measuring

There are high expectations from users for council websites to be up and reliable. They are also required to adhere to guidelines set out in the Service Standard to make their website accessible and user friendly. Alongside these challenges, councils are often underfunded and understaffed which can make council web management teams stretched. Here are three key metrics that councils should be measuring to improve website reliability.

Optimize Application Performance with Code Profiling

When monitoring your application performance or troubleshooting an issue in production, context is key. The more information available, the faster the prevention of or detection of a user impacting issue. Observability tools offer many different features, like code profiling, to help contextualize your data. In this post, I’ll discuss what code profiling is and show an example of how it works.

A Complete Guide to Google's Core Web Vitals and How to Optimize Them

The success of your website lies in how satisfied your users are with it. To help ensure the quality of your user experience, Google uses various signals from a web page. The three Core Web Vitals are some of the most important ones. In this article, I’ll talk about what each Core Web Vital means and how to optimize them to deliver a better user experience.