Operations | Monitoring | ITSM | DevOps | Cloud

The Complete Guide to Metrics, Monitoring & Alerting

Monitoring your system and infrastructure is critical to ensure the performance of your services. In fact, as software development moves faster and faster, alerting and monitoring becomes an indispensable practice for modern DevOps teams. Why is that exactly? That’s what I’m going to discuss today.

How blocks storage in Cortex reduces operational complexity for running Prometheus at massive scale

Cortex is a long-term distributed storage for Prometheus. It provides horizontal scalability, high availability, multi-tenancy and blazing fast query performances when querying high cardinality series or large time ranges. Today, there are massive Cortex clusters storing tens to hundreds of millions of active series with a 99.5 percentile query latency below 2.5s.

Global Energy Leader Transforms Technology and Culture with Kubernetes

When your company is born in the first Industrial Revolution, how do you stay relevant in the digital age? For Schneider Electric, the answer is continuous innovation, driven by its heritage in the electricity market. Founded in the 1880s, Schneider Electric is a leading provider of energy and automation digital solutions for efficiency and sustainability.

Why Netdata is free

When I first started the project that became the Netdata Agent, I was trying to solve a painful, real-world problem: IT infrastructure monitoring tools fell short when it came to providing complete, granular metrics in real time. Believe me, I had no shortage of tools to choose from, but each of them lacked something either in the ability to see every metric I needed, or see it at the frequency required.

Introducing the all-new Netdata Cloud

Netdata Cloud works differently from other monitoring solutions. Most solutions limit the number and frequency of metrics because they rely on architectures that aggregate data. Netdata Cloud, however, streams limited metadata from each node running the Netdata Agent, keeping you in control of the data on your systems. The advantage of this architecture is that there is no limit to the number or frequency of metrics, regardless of the scale or complexity of your IT infrastructure.

Observability Across the Development Lifecycle: A Convo with Andre Boutet of OneSpan

At OpenObservability, we had the pleasure to sit down with Andre Boutet, the Senior Director of Cloud Operations and Services for OneSpan. Andre had a conversation with our CTO, Jonah Kowall, around what observability means to his team and his organization. Teaser: It’s not just about ensuring uptime and availability for external systems. It’s a philosophy with a foundation on supporting the entire development lifecycle.

CloudOps with OpsRamp: From Discovery to Resolution

Are you able to observe, monitor and act upon cloud assets - wherever they are? In this Tech Talk, we'll discuss the role of cloud operations and review OpsRamp's unique capabilities for hybrid infrastructure discovery, monitoring, service maps, remote consoles and remediation. This Tech Talk will showcase the collective value of a unified IT operations management platform, reviewing multiple OpsRamp features and tying them to the most common CloudOps workflows.