Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Chaos testing a Postgres cluster managed by CloudNativePG

As more organizations move their databases to cloud-native environments, effectively managing and monitoring these systems becomes crucial. According to Coroot’s anonymous usage statistics, 64% of projects use PostgreSQL, making it the most popular RDBMS among our users, compared to 14% using MySQL. This is not surprising since it is also the most widely used open-source database worldwide.

Implementing High-Cardinality Instrumentation in Frontend Apps

As the Product Manager for Honeycomb’s new frontend product, Honeycomb for Frontend Observability, I’ve had the joy this past year of speaking to dozens of frontend engineering teams about observability. Many frontend teams come from worlds where they either rely on QA and customer reports to identify issues in production, or they use real use monitoring (RUM) and error monitoring tools to catch the most egregious issues.

The importance of understanding and observing an application's middle-tier components

Just like how the filling makes a sandwich, an application's performance is closely tied to how effectively its middle-tier components function. While the front-end is what users see and interact with (UI), and the back-end deals with data storage, the middle tier forms the vital core where the real magic happens—processing, logic implementation, and enforcement of business rules.

How to Use Static Thresholds for Effective Alerts in Splunk Observability Cloud

In this video, we explore the concept of static thresholds, which are a foundational tool in your observability alerting solution. You’ll learn: Additionally, we will demonstrate static thresholds in Splunk Observability Cloud. We’ll configure a static threshold for AWS EC2 memory utilization. We’ll also look at additional threshold settings like trigger sensitivity and duration. By the end of this video, you'll have the knowledge to effectively incorporate static thresholds into your observability strategy.

Enrich your on-call experience with observability data at your fingertips by using Datadog On-Call

The stress, sudden disruptions, and high stakes of resolving issues while on call is one of the most challenging aspects of an engineer’s job. Many organizations, from startups to large enterprises, still struggle with their on-call experience, which leads to longer resolution times and lower employee retention rates. Constant context switching, managing multiple tools, and racing against time to resolve issues can cause frustration, burnout, and inefficiency.

The Most Important Developer Productivity Metric

We love to talk about the value of observability in accelerating feedback loops by enabling teams to understand what changes they need to make to software. But a barrier that often holds teams back from completing the feedback loop is how long it takes to actually get feedback on code under development, or push code into production.

Optimize Observability and Cut Costs Without Losing Insights | What is Adaptive Telemetry? | Grafana

Managing telemetry can quickly spiral out of control, leading to ballooning costs and overwhelming data volumes. But what if you could save time, reduce costs, and maintain the critical insights your team relies on? In this video, learn how Adaptive Telemetry helps you: Sign up for a free Grafana Cloud account today and unlock the potential of distributed tracing in your performance testing workflow.

Top 13 Splunk Alternatives in 2025: From Open Source to Enterprise Solutions

Splunk is a powerful tool for data analysis and monitoring, but its high costs and complex implementation can be challenging for many organizations. Here are 13 proven Splunk alternatives that provide robust monitoring capabilities, comprehensive data analysis, and more cost-effective solutions for organizations of all sizes.