Fulton, MD, USA
May 27, 2020   |  By Heinrich Hartmann
Crash events are one of the more serious problems that can occur when operating a service. Crashing components often cause cascading failures and service outages. To reveal the magnitude of damage and help prevent future occurrences, visibility into crash events is critical. Unfortunately, debugging crashes is one of the more complicated endeavors. The state of a crashed process is often compromised and the process can’t be trusted to collect debugging information on its own.
May 20, 2020   |  By Bob Moul
In a recent post I talked about the strain being placed on IT Infrastructure with the current surge in demand for online services being driven by the COVID-19 pandemic. I talked about how this sudden migration to online has exposed weaknesses in, and in some cases a total lack of, adequate monitoring practices. Unfortunately, many online sites have experienced degradation of service, poor customer experiences, and even complete outages.
May 14, 2020   |  By Bob Moul
COVID-19 is impacting nearly every company around the world. While the pandemic is affecting companies in different ways and to different degrees, a commonality many are experiencing is that the coronavirus is forcing much of our daily commerce activity online. I wrote in a post recently that literally overnight we’ve had to find new ways of working, meeting, shopping, managing healthcare, and even staying entertained.
Apr 24, 2020   |  By Bob Moul
This week, we announced the availability of our Spring 2020 release. The highlight of the release is our Kubernetes monitoring solution, which provides health-based alerting and horizontal pod auto-scaling. Additional enhancements include cloud monitoring, GCP Marketplace availability, performance improvements, and a more comprehensive Terraform integration. Here’s some background on these latest capabilities.
Apr 17, 2020   |  By Heinrich Hartmann
Latency SLOs help us quantify the performance of an API endpoint over a period of time. A typical latency SLO reads as follows: The proportion of valid* requests served over the last 4 weeks that were slower than 100ms is less than 1%. *In this context, “valid” means that the request responded with a status code in the 200s.
Jan 3, 2020   |  By Circonus
Machine Data Intelligence enables what was previously impossible due to technology limitations. Now, enterprises have unprecedented access to business insights that are driving transformative results.
Dec 1, 2019   |  By Circonus
As the number of things we want to monitor grows and sensors proliferate our world, enterprises have the opportunity to gather and analyze vast amounts of machine-generated data that provide extraordinary business insights.
Dec 1, 2019   |  By Circonus
Theo Schlossnagle discusses the most important trends in DevOps.
Nov 1, 2019   |  By Circonus
Current monitoring systems weren't built for the new requirements of IT. How can you be sure that yours is?
Nov 1, 2019   |  By Circonus
The key to improving your service quality could lie in accurately measuring the metrics associated with your microservices.
May 20, 2019   |  By Circonus
Understand the behavior of your systems with the Circonus monitoring and analytics platform, powered by histograms.
May 8, 2018   |  By Circonus
In this talk, we take a look at both log-linear and cumulative histograms and how they provide advantages over storing data as quantiles, averages, and other histogram implementations, such as linear and fixed-bucket.
Apr 17, 2018   |  By Circonus
IRONdb, the scalable, fault-tolerant Time Series Database, now has a new Grafana Datasource Plugin.
Apr 17, 2018   |  By Circonus
IRONdb is a durable, fault-tolerant Time Series Database, now with a Grafana Datasource Plugin. IRONdb is highly scalable, enabling you to reduce your hardware infrastructure by 50%.
Apr 17, 2018   |  By Circonus
IRONdb is the scalable, durable TSDB, now with a Grafana Datasource plugin. Unlock the power of Grafana's heatmaps and histograms.