Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Time Series Forecasting Use Cases and Anomaly Detection

Wouldn’t it be great to peek into the future and find answers to the problems that you’re facing today? This may sound like science fiction, but many companies currently possess this capability, and they are creating strategies around it to strengthen their monitoring and analytical capabilities. One way is time series forecasting, a statistical method. You can take advantage of the insights of time series forecasting by using techniques like anomaly detection to gain.

Using the Density Function for Adaptive Thresholding with Splunk

It’s 3PM on a Friday, and your day is winding down. Suddenly, you get an urgent email from your boss asking you to set up an alert for monitoring volume. You consider this an easy task. You set a hard threshold for what you think is a low volume based on the last four hours of incoming data.

Retrace Power User Tips and Tricks - Extending APM

Retrace is the full lifecycle APM solution that includes tools and capabilities far beyond your typical APM tool. With sophisticated log management, detailed code tracing, deployment tracking and more, Retrace delivers what your DevOps team needs most to resolve issues before impacting users. By extending usability beyond traditional APM functionality, Retrace provides greater value than competitive products. But where would Retrace be without robust APM functionality?

How to monitor and troubleshoot Fluentd with Prometheus

Fluentd is an open source data collector widely used for log aggregation in Kubernetes. Monitoring and troubleshooting Fluentd with Prometheus is really important to identify potential issues affecting your logging and monitoring systems. In this article, you’ll learn how to start monitoring Fluentd with Prometheus, following Fluentd docs monitoring recommendations. You’ll also discover the most common Fluentd issues and how to troubleshoot them.

Anomaly rate in every chart

A month ago, we introduced unsupervised ML & Anomaly Detection in Netdata, the Anomaly Advisor. Today, we’re happy to announce that we’re bringing anomaly rates to every chart in Netdata Cloud. Anomaly information is no longer limited to the Anomalies tab and will be accessible to you from the Overview and Single Node View tabs as well. This will make your troubleshooting journey easier, as you will have the anomaly rates for any metric available with a single click.

7 Essential Tips For Choosing The Best Domain Name and Why it Matters

Choosing a domain name is a task that requires your full attention and should be thought about long before you undertake your website launch checklist. An inappropriate or poorly thought-through domain name is something you'll be stuck with for the foreseeable future. Domain names are tricky to change and can lead to serious SEO complications for your brand. By researching, you'll be better informed to choose a domain that works for you by maximizing website traffic and driving business revenue.

How to send logs to Grafana Loki with the OpenTelemetry Collector using Fluent Forward and Filelog receivers

In this guide, we’ll set up an OpenTelemetry Collector that collects logs and sends them to Grafana Loki running in Grafana Cloud. We will consider two examples for sending logs to Loki via OpenTelemetry Collector. The first one shows how to collect container logs with a Fluent Forward receiver. The second one shows how to collect system logs with a Filelog receiver.

Scaling Engineering Teams: Perspective from our VP of Engineering

At Catchpoint, my role can be summarized at a high level as two halves: designing and taking care of Engineering teams – and working with those teams to design and take care of the various distributed systems that run our platform. I recently attended Sapphire Ventures' Hypergrowth Engineering Summit (thank you David Carter and Sapphire for the invitation!) where the sessions focused on creating and scaling high functioning engineering.