Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Long-Term Data Storage and Retention in Netdata

Netdata’s database engine (dbengine) provides a sophisticated multi-tiered storage system designed for efficient long-term data retention while maintaining high granularity. This article explores the technical details of how Netdata handles metric storage, the advantages of its distributed architecture, and how to configure it for your specific needs.

What is DDI? Meaning, Features & Benefits

As a network administrator, having full visibility and control over your network infrastructure is critical. However, managing core network services like DNS, DHCP, and IP addresses can become complex, especially as your network grows. This is where DDI comes in. DDI (DNS, DHCP, IP Address Management) solutions integrate these essential networking functions into a single, centralized management platform.

IT Inventory Management

You can’t monitor, protect, or fix what you don’t know. That simple concept helps explain why IT inventory management is the cornerstone of effective IT and security ops. However, given the highly distributed and dynamic nature of modern networks, maintaining an up-to-date inventory can be challenging. Modern IT assets are everywhere, from corporate data centers to third-party clouds to coffee shops where remote workers stop for a snack.

Demystifying the OpenTelemetry Operator: Observing Kubernetes applications without writing code

The promise of observing your application without writing code (i.e., auto-instrumentation) is not new, and it’s extremely compelling: run a single command in your cluster and suddenly application telemetry starts arriving at your observability backend. What else could you ask for? The OpenTelemetry Operator aims to fulfill such a dream for Kubernetes environments by using a set of well known patterns such as operators and custom resources.

Keys to Success: Three AIOps Best Practices

When IT operations run smoothly, it’s more likely everything else in the organization will as well. Unfortunately, tech sprawl can make IT environments more prone to issues that hinder end users or, worse, customers. Recent research shows that up to 50% of organizations juggle multiple tools for observability. Too many disparate tools to monitor too many systems and applications create siloes, slowing incident response and resolution times.

Sentry's Pinia Integration for Vue and Nuxt Error Tracking

When debugging issues in production, context is everything. While Sentry already provides rich error data like stack traces, breadcrumbs, and user information, understanding the application state at the time of an error can still help reproduce, fix and ship quickly. Sentry’s Pinia integration solves this by automatically capturing Pinia state wherever errors occur. Now you get the complete picture of your Vue or Nuxt application's state at the moment things went wrong.

Optimizing RabbitMQ Performance: The Metrics That Matter

RabbitMQ is a powerful, reliable, and widely used message broker that forms the backbone of modern microservices architectures. However, ensuring its performance and reliability requires proactive monitoring of key metrics. In this blog, we will explore the essential RabbitMQ metrics, their units, possible issues, solutions, and how tools like Atatus can simplify monitoring and troubleshooting.

TikTok Emerges from Shutdown Without Bytedance's US CDN

Kentik’s Doug Madory looks into this weekend’s 14-hour outage of popular video sharing service TikTok, which was slated to be banned from the US per recent legislation. While TikTok came back, it is notably no longer being served by parent company Bytedance’s US CDN. We delve into the traffic statistics in this blog post.

How to Build a Cloud Strategy That Works for Your Business

As technology advances at lightning speed, more and more businesses are turning to the cloud to boost growth, improve efficiency, and stay ahead of the competition. However creating a cloud strategy that matches your business goals, budget, and security needs can be tricky. It’s not just about switching to the cloud—it’s about using it wisely to get the most out of it.

Serilog: Configuration, Error Handling & Best Practices

When building modern.NET applications, logging is one of those things you don’t want to get wrong. Serilog steps in as a popular logging framework that has earned its spot as a go-to tool for developers. Why? Because it’s flexible, versatile, and does an awesome job of giving you clear insights into your app's behavior. But what exactly is Serilog?