Jun 23, 2022   |  By Shyam Sreevalsan
A month ago, we introduced unsupervised ML & Anomaly Detection in Netdata, the Anomaly Advisor. Today, we’re happy to announce that we’re bringing anomaly rates to every chart in Netdata Cloud. Anomaly information is no longer limited to the Anomalies tab and will be accessible to you from the Overview and Single Node View tabs as well. This will make your troubleshooting journey easier, as you will have the anomaly rates for any metric available with a single click.
Jun 15, 2022   |  By Andrew Maguire
As of v1.35.0 the Netdata Agent can now run Metric Correlations (MC) itself. This means that, for nodes with MC enabled, the Metric Correlations feature just got a whole lot faster! The Netdata Metric Correlations feature uses a Two Sample Kolmogorov-Smirnov test to look for which metrics have a significant distributional change around a highlighted window of interest.
Jun 7, 2022   |  By Christopher Boydstun
Sometimes a hat is just a hat, the truth is just the truth, and the clearly most popular example of a category is plain to see. In this case, Ubuntu is the most popular Linux distribution currently available. With the operating system’s superior popularity also comes an amazing amount of community support.
Jun 7, 2022   |  By Shyam Sreevalsan
Netdata’s new Anomaly Advisor feature lets you quickly identify potentially anomalous metrics during a particular timeline of interest. This results in considerably speeding up your troubleshooting workflow and saving valuable time when faced with an outage or issue you are trying to root cause.
May 20, 2022   |  By Shyam Sreevalsan
Imagine this: You are an engineer at a startup. You are responsible for keeping all the applications running smoothly and safely in production. At first, you have things under control, but soon enough things start getting more complex.
May 18, 2022   |  By Andrew Maguire
Today we are excited to launch one of our flagship ML assisted troubleshooting features in Netdata – the Anomaly Advisor. The Anomaly Advisor builds on earlier work to introduce unsupervised anomaly detection capabilities into the Netdata Agent from v1.32.0 onwards.
May 3, 2022   |  By Costa Tsaousis
In the Kubernetes (K8s) community, there is a huge misconception about CPU allocation and utilization. Even highly experienced SREs find themselves struggling with the way Kubernetes allocates CPU resources, leading to misconfigured CPU allocations and extremely negative outcomes. For starters, this results in significant quality degradation on important service components, introduced by behind-the-scenes CPU limiting (or throttling).
Apr 28, 2022   |  By Tasos Katsoulas
At Netdata, we love two things more than anything else: Our goal is to make troubleshooting and monitoring as seamless as possible with the open-source Agent. This includes giving you pre-configured alerts so that you get notified immediately when a disruption occurs. The Netdata Agent comes with over 250 pre-configured and optimized alerts.
Apr 27, 2022   |  By Andrew Maguire
Our Analytics & ML lead Andrew Maguire recently had a chance to share our new Anomaly Advisor feature with the wider CNCF community. In his demonstration he did some light chaos engineering (using Gremlin and stress-ng) to generate some real anomalies on his infrastructure and watch how it all played out in the Anomaly Advisor in Netdata Cloud. There were also some great questions and discussion from the audience around ML in general and in the observability space itself.
Apr 4, 2022   |  By Tina Luedtke
Together with you, our fabulous community, Netdata is changing the way the world thinks of high fidelity monitoring – and we are gaining momentum. Our chief troublemaker and CEO, Costa Tsaousis, is the pioneer and architect of this revolution that’s brewing in the monitoring and troubleshooting space. Watch him explain the Netdata way of troubleshooting.
Jun 14, 2022   |  By netdata
The latest Netdata Agent release v1.35 introduces massive improvements for the machine learning-powered Anomaly Advisor, Metric Correlations, Kubernetes monitoring, and much more. Anomaly Advisor & on-device Machine Learning This release features a launch of the flagship machine learning (ML) assisted troubleshooting Anomaly Advisor. Unsupervised ML models are trained for every metric, at the edge, on your devices, enabling real-time anomaly detection across all your systems and applications.
Jun 6, 2022   |  By netdata
OK, it took our video guy exactly 42 seconds, but we're allowing for some leeway here. Basically, if you want to monitor a Linux node, whether physical, virtual, container, or IoT, all you need to do is.
Jun 1, 2022   |  By netdata
In this video, our Analytics & ML Lead, Andrew Maguire, walks through an example case study using the K6 load testing platform to run a load test against some of our demo servers running Netdata. Watch in real-time as the Anomaly Advisor reacts to the load test and painlessly surfaces the most anomalous metrics, making it easy to just "see" the load test and how it plays out on the servers.
May 30, 2022   |  By netdata
This video livestream meetup by Netdata takes a deep dive into the fundamentals of Machine Learning in DevOps Infrastructure Monitoring. It also covers the Netdata way of approaching Machine Learning. The Anomaly Advisor major update to Netdata is introduced as a valuable troubleshooting tool for any DevOps or Site Reliability Engineer looking for anomalies in their infrastructure. The hosts share real-world infrastructure monitoring & troubleshooting examples, as well as early feedback from the community on the Anomaly Advisor.
May 26, 2022   |  By netdata
Netdata's Lead Machine Learning Engineer, Andrew Maguire, walks through how to configure the all-new Anomaly Advisor. This new feature lets you troubleshoot in real-time, at scale, by identifying periods of time with raised anomaly rates across your entire infrastructure. In this guided video, Andrew will explain how to enable Netdata's ML functionality then, how to set up unsupervised anomaly detection with minimal configuration, and lastly how the Anomaly Advisor works to speed up troubleshooting when an incident occurs.
May 25, 2022   |  By netdata
Troubleshoot at scale with our all-new, lightweight Anomaly Advisor, powered by machine learning. The Anomaly Advisor finds periods of time with elevated anomaly rates across your entire infrastructure faster than ever before. This new feature works along with our ML unsupervised models on the edge, making your troubleshooting trouble-free! Even better, the Anomaly Advisor requires minimal configuration and is extremely lightweight. No need to worry about exhausting your CPU usage.
May 20, 2022   |  By netdata
Kubernetes monitoring can be confusing, but we're here to help. With Netdata, you can have high-fidelity monitoring of Kubernetes in a process so easy that we can explain it in just two minutes.
May 18, 2022   |  By netdata
A short walkthrough of the Anomaly Advisor feature in Netdata.
May 4, 2022   |  By netdata
Kubernetes has a bad habit of throttling CPU resources—with the result that you can suffer severely degraded performance or find yourself paying a fortune for extra, unnecessary infrastructure. Watch this video to learn how K8s clusters protect themselves from what they see as heavy CPU usage, and how you can monitor and troubleshoot the problem. We demonstrate how you can:– Use Netdata to reduce API response times by a factor of 7– Expect to reduce infrastructure resource requirements by 60-75%
Apr 28, 2022   |  By netdata
Your trouble is now our trouble, and we have set up a forum where we can all work together to find the best solutions to our countless out-of-the-box troubleshooting alerts, so you're no longer by yourself when the 💩hits the fan.

netdata is a system for distributed real-time performance and health monitoring. It provides unparalleled insights, in real-time, of everything happening on the system it runs (including applications such as web and database servers), using modern interactive web dashboards.

netdata is fast and efficient, designed to permanently run on all systems (physical & virtual servers, containers, IoT devices), without disrupting their core function.