Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

What's the hype about Machine Learning?

Can it help businesses? Machine learning is an inescapable buzzword for many in the operations sector. Even friends and colleagues tend to make us aware of a new ML tool that may or may not be useful. While there are many ML tools in the market, not all are suitable for every business. Some tools, when tested, struggle to solve basic, everyday use cases. Therefore, when evaluating ML tools, other deeper questions and issues do arise.

Google Cloud Managed Service for Prometheus

Welcome back to GKE Essentials! In this episode, Kaslin Fields explores a key element of your GKE observability: Google Cloud Managed Service for Prometheus. Watch to see how Google Cloud's fully managed multi-cloud solution for Prometheus lets you globally monitor and alert on your workloads without having to manually manage and operate Prometheus at scale.

7 types of Redis latency and how to fix it

Redis is designed to be fast. In most cases, it is. However, there are times when Redis may be slow, due to network issues, disk latency, or other factors. When this happens, it is important to be able to detect the slow down and investigate the cause. Latency is the maximum delay between the time a client issues a command and the time the reply to the command is received by the client. Redis has strict requirements on average and worst case latency.

How to monitor systemd service liveness

The life of a sysadmin or SRE is often difficult, but occasionally very simple things can make a huge difference. Basic monitoring of your systemd services is one of those simple things, which we sometimes overlook. The simplest question one would want to know is if the thing that’s supposed to be running is actually running at all. If you use systemd services, you can guarantee an answer to that question within minutes using Netdata.

Modern IT Infrastructure Management: Three Pillars for Success

Your IT Infrastructure team faces untenable demands on their time and resources as your organization increasingly relies on complex hybrid infrastructures and an ever-growing set of technologies and cloud-based services. You can actually limit visibility and slow triage as you add monitoring tools to cover this expansion, blocking critical insight into your environment at the IT service level. What’s really required to be successful in today’s IT infrastructure environment?

Introducing PrivateLink Support for Enterprise

Network topology can get very complicated in the cloud, especially when you’re sending data to external SaaS providers. You will likely need to configure gateways and firewalls and keep close tabs on those points of egress. However, if your infrastructure exists within AWS, there’s a much simpler way and that’s through an AWS PrivateLink endpoint.

Metrics in Minutes: Prometheus Metrics into Coralogix using Open Telemetry

In this video, we'll explore how to connect Prometheus to Coralogix, using the Open Telemetry collector as an abstraction layer. This deployment minimises 3rd party code in your system, and provides a completely open source path to integration, all while using industry standard, simple tooling.

AIOps: The Future of IT Operations Management in 2023

AIOps stands for artificial intelligence for IT Operations. AIOps, is a set of tools and algorithms that gather data from the entire IT environment, including different monitoring systems, log files and other IT data sources. It then analyzes and applies machine learning algorithms to determine the root cause of an incident. This means that instead of having to go through a long troubleshooting process by analyzing log files and manually looking for root causes, AIOps does it for you in minutes.

Top 3 Issue Alert Tips to Stop Noisy Notifications

Sentry Alerts ping you on Slack, Microsoft Teams, or Pager Duty when something goes needs your attention. However, too many alerts can turn your notification channel into an endless noise feed. I spoke with dozens of Sentry customers in the past 6 months, and something I heard over and over again was “Sentry can get noisy at times” and “There are days I can’t keep up with Sentry notifications because we get so many of them”. Does this sound familiar?