Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Part 5: Proactive Observability With AIOps- Level 4

Level 4, Proactive Observability With AIOps, is the most advanced level of observability. At this stage, artificial intelligence for IT operations (AIOps) is added to the mix. AIOps, in the context of monitoring and observability, is about applying AI and machine learning (ML) to sort through mountains of data looking for patterns.

Understanding the Observability Maturity Model

Based on research and conversations with enterprises from various industries, StackState created the Observability Maturity Model. This model defines the four stages of observability maturity. The ultimate destination is level four, Proactive Observability with AIOps. However, even moving from level one to two, or from level two to three, is a huge improvement in your ability to get essential insights into your IT environment.

Just How Bad is a Down, Slow, or Dysfunctional Website? It's Worse than You Think!

Have you ever watched a movie (*cough* Godfather III) and said to yourself: “wow, this is so incredibly bad — I don’t think this can get worse!” But then it does. Much, much worse. Well, having a down, slow, or dysfunctional website is similarly nightmarish — just when you think the reputation devastation is finally over, there’s more on the horizon. With apologies to Shakespeare: hell hath no fury like a customer scorned. Not convinced?

How to convert a mini-arcade machine into a Grafana dashboard display with Raspberry Pi

When COVID-19 hit, Yonatan Mevorach faced an unexpected challenge, which required an unexpected solution. The Infrastructure Team Lead at Wix, the popular website building platform, was accustomed to looking at multiple monitors on the walls of the software company’s offices in Tel Aviv, Israel. These monitors cycled through Grafana dashboards to help the team keep tabs on Wix’s many services.

Released: Better Uptime Integration

StatusGator has a wide a variety of use cases: from education to help desk to IT and managed services and DevOps, too. All corners of an organization depend on cloud services and StatusGator gives you visibility into the status of all of your vendors. We’ve heard over and over from our DevOps users that alerts and notifications for their teams are already centralized into a single incident management platform such as OpsGenie, PagerDuty, or FireHydrant.

How to monitor OpenShift with Sysdig Monitor

Monitoring Red Hat OpenShift brings up challenges compared to a vanilla Kubernetes distribution. Discover how Sysdig Monitor, and its exclusive features in OpenShift, will help you monitor and troubleshoot your issues fast and easily. OpenShift builds many out-of-the-box add-ons into its Kubernetes foundation. For example, the OpenShift API server, Controller Manager, Ingress, or Marketplace ecosystem. This creates a more complex environment that can cause you to struggle.

What can be learned from recent BGP hijacks targeting cryptocurrency services

On August 17, 2022, an attacker was able to steal approximately $235,000 in cryptocurrency by employing a BGP hijack against the Celer Bridge, a service which allows users to convert between cryptocurrencies. In this blog post, I discuss this and previous infrastructure attacks against cryptocurrency services. While these episodes revolve around the theft of cryptocurrency, the underlying attacks hold lessons for securing the BGP routing of any organization that conducts business on the internet.

Troubleshooting SaaS User Experiences with AppNeta from Broadcom Software

Many organizations are moving to SaaS-based hosting environments but how do you monitor the user experience of these apps when they no longer exist within the four walls of your data center? In this demo, we are going to reveal how network operations teams can gain true visibility into the user experience with AppNeta from Broadcom Software - even when they do not own any of the network infrastructure delivering that experience.

Key Observability Scaling Requirements for Your Next Game Launch: Part II

In Part I in our series outlining best practices for scaling observability, we reviewed the data analysis capabilities that can help engineers troubleshoot faster during high pressure situations during a game launch. Nobody wants lag time or crashes in their game launch. Similarly, no one wants terminated sessions or for your gamer customers to log off and play a competitor’s game.