Operations | Monitoring | ITSM | DevOps | Cloud

6 Real-World Status Page Examples: And What You Can Learn From Them

A status page is the most effective way to stay in touch with your users and quickly inform them about any outages or ongoing maintenance. As explained in our previous article, status pages can offer many benefits such as cost savings and a reduced number of support tickets. Creating a status page can significantly improve your incident management and relationships with your customers.

15 Best (Free) Windows Utilities for SysAdmins

Running an organization's IT infrastructure is not easy, but having free windows utilities can make all the difference. Whether you provide tech support to a school, run the server of a startup, or manage a complex IT infrastructure, the work of a SysAdmin can be highly challenging. We're talking about unforeseen issues that require immediate attention, balancing the needs of different people, and being responsible for the upkeep of multiple computers.

Cloud Providers Health Report - January 2023

Check our January 2023 health report on the top most popular cloud providers. We analyze the health of the cloud providers based on the number of outages and problems during the month. The source of the data is made available by the cloud providers themselves via their status page. We normalize it and use it to generate the report.

Extending Netdata's anomaly detection training window

We have been busy at work under the hood of the Netdata agent to introduce new capabilities that let you extend the "training window" used by Netdata's native anomaly detection capabilities. This blog post will discuss one of these improvements to help you reduce "false positives" by essentially extending the training window by using the new (beautifully named) number of models per dimension configuration parameter.

Total experience: Today's top business multiplier

The pressures that converged upon businesses during the pandemic forced the rapid evolution of both the customer experience (CX) and employee experience (EX). In that make-it-or-go-under environment, many of those that survived came out ahead in terms of CX and EX. Now we face the next hurdle: strong macroeconomic headwinds. Inflation and the threat of a recession are forcing businesses to prove their resilience once again.

How Grafana Labs uses and contributes to OpenCost, the open source project for real-time cost monitoring in Kubernetes

While more and more teams are adopting Kubernetes as their standard container orchestration technology, cost insight is lacking. Teams often don’t know how much they’re spending, where in their organization they are spending, or what is driving their infrastructure cost increases. OpenCost helps alleviate this problem by bringing real-time cost monitoring to Kubernetes workloads with a solution that encompasses both an open specification and an open source project.

Infrastructure metrics expanded to longer time frames

Understanding your systems’ status is essential for ensuring the reliability and stability of your applications and services. Without full awareness of what’s going on within your infrastructure, it can be difficult to manage solvable issues and to achieve reachable goals. Besides, it wouldn’t make much sense to run an app or service such as an e-store while ignoring what’s actually happening with it. How can you make any decisions that way?

TL;DR InfluxDB Tech Tips: Downsampling with Flight SQL and AWS Lambda

This tutorial covers how to perform downsampling with the new InfluxDB storage engine, InfluxDB IOx, in InfluxDB Cloud (available on AWS us-east-1 and AWS eu-central-1 starting January 31st) using AWS Lambda. This tutorial describes how to: InfluxDB IOx addresses key user needs including (but not limited to): We achieved these goals by building InfluxDB IOx on the Apache ecosystem (Apache Parquet, Apache DataFusion, Apache Arrow, and Apache Flight SQL).

Autocatalytic Adoption: Harnessing Patterns to Promote Honeycomb in Your Organization

When an organization signs up for Honeycomb at the Enterprise account level, part of their support package is an assigned Technical Customer Success Manager. As one of these TCSMs, part of my responsibilities is helping a central observability team develop a strategy to help their colleagues learn how to make use of the product.