Operations | Monitoring | ITSM | DevOps | Cloud

The benefits and challenges of a single pane of glass

SCOM 2019 is a monitoring powerhouse. Its capabilities are unmatched. But it also has some serious issues when it comes to unearthing and visualizing the valuable data locked inside. The replacement of Silverlight with HTML5 in the SCOM 2019 web console was a welcome enhancement, but the SCOM web console still shares its design with the administration console, which is slow, complex, and makes it downright difficult to get the visibility you need.

Silencing Distractions with Review List and Automations

Responding to and ignoring notifications can be a full-contact sport. It makes sense, though, from GitHub, Slack, to Jira and Sentry; our world revolves around robots telling us everything is important, critical, and urgent. Just like that, it’s near impossible to see what actually matters so you can solve quicker and more comprehensively.

What is Hyperconverged Infrastructure?

Hyperconverged Infrastructure is a unified system that combines computer network and storage in one easy way to manage virtualized systems. To give you a brief understanding, these systems have two major components hypervisors and storage controllers. To elaborate further, typically the hyper converged systems are available as fully integrated hardware appliances and a standalone software. The question now arises how does it work?

Grafana, Loki, and Tempo will be relicensed to AGPLv3

Grafana Labs was founded in 2014 to build a sustainable business around the open source Grafana project, so that revenue from our commercial offerings could be re-invested in the technology and the community. Since then, we’ve expanded further in the open source world — creating Grafana Loki and Grafana Tempo and contributing heavily to projects such as Graphite, Prometheus, and Cortex — while building the Grafana Cloud and Grafana Enterprise Stack products for customers.

Q&A with Grafana Labs CEO Raj Dutt about our licensing changes

When Grafana Labs CEO and co-founder Raj Dutt announced to the team that the company would be relicensing our core open source projects from Apache 2.0 to AGPLv3, he opened the floor for discussion and encouraged anyone who had further questions to reach out. We believe in honesty and transparency, so we collected hard questions from Grafanistas, and Raj answered them for this public Q&A. The time felt right. As I’ve said publicly before, I’ve been thinking about this topic for years.

Using Coralogix + StackPulse to Automatically Enrich Alerts and Manage Incidents

Keeping digital services reliable is more important than ever. When something goes wrong in production, on-call teams face significant pressure to identify and resolve the incident quickly – in order to keep customers happy. But it can be difficult to get the right signals to the right person in a timely fashion.

DHCP server monitoring made easy with OpUtils

In today’s complex IT infrastructures, Dynamic Host Configuration Protocol (DHCP) servers play an indispensable role in automating IP allocation and configuration. A DHCP server’s capacity to allocate IPs to the requesting clients in real-time is one of the factors that ensures constant uptime of dynamic networks. However, even though a network’s availability depends on them, DHCP servers are often not closely monitored by IT teams.

InfluxDB's Checks and Notifications System

InfluxDB 2.0’s Checks and Notifications system is likely the most powerful and flexible system available for creating alerts based on time series data. To get the most out of the system, it is helpful to understand the different pieces and how they fit together. After reading this article, you should be able to create precise alerting using the InfluxDB 2.0 User Interface (UI), as well as be able to extend and customize the system to suit your specific needs.

Using Telegraf to Collect Infrastructure Performance Metrics

Telegraf is a server-based agent for collecting all kinds of metrics for further processing. It’s a piece of software that you can install anywhere in your infrastructure and it will read metrics from specified sources – typically application logs, events, or data outputs.