Operations | Monitoring | ITSM | DevOps | Cloud

Logging

The latest News and Information on Log Management, Log Analytics and related technologies.

Swift: Transforming product instrumentation with Elastic Observability

As the leading global provider of secure financial transactions and payments, it's vital for SWIFT to stay relevant. With more than 45 million messages flowing through its systems every day and being at the heart of the financial industry, SWIFT is at the forefront of secure, frictionless financial services including sanctions screenings, compliance analytics, KYC (Know Your Customer) registry, and payment controls.

The Ultimate CPU Alert - Reloaded, Again!

It’s been nearly ten years since “The Ultimate CPU Alert – Reloaded” and its Linux version were shared with the SolarWinds community. At that time, managing CPU data from 11,000 nodes, with updates every five minutes to a central MSSQL database, was a significant challenge. The goal was to develop alerting logic to identify when a server was experiencing high CPU usage accurately.

Webinar Recap: Myths and Realities in Telemetry Data Handling

Telemetry data is growing exponentially, but the business value isn’t increasing at a similar pace. Getting the right telemetry data is hard, so I recently had a conversation with Matt Aslett, Director of Research at Ventana Research, now a part of ISG, about five myths and realities in telemetry data handling.

Turning Logs into Metrics with OpenTelemetry and BindPlane OP

Turning logs into metrics isn’t a new concept. A version of this functionality is implemented in most agents, visualization tools, and backends. It’s everywhere because converting logs to metrics has many practical applications and is one of the fundamental mechanisms for controlling log volume in a telemetry pipeline. In this post, I’ll briefly overview log-based metrics, explain why they matter, and provide examples of how to build them using OpenTelemetry and BindPlane OP.

Maximize IT efficiency leveraging alert management with Elastic AI Assistant for Observability

Manage and correlate signals and alerts in Elastic Observability As organizations embrace increasingly complex and interconnected IT systems, the sheer volume of alerts generated by diverse monitoring tools has given rise to a critical challenge — how do we efficiently sift through the noise to identify and respond to the most crucial issues? Event management and correlation are two indispensable pillars in the realm of IT service management.

Mastering Log Retention Policy: A Guide to Securing Your Data

The strategic implementation of a security log retention policy is critical for safeguarding digital assets and key company data. This practice is foundational for detecting and analyzing security threats in real-time and conducting thorough post-event investigations. Integrating the nuances of log analytics system costs, which escalate with data volume due to the infrastructure needed for storage and processing, highlights a critical aspect of security log retention.

Grafana Cloud updates: cool visualizations, log monitoring made easier, simplified alert routing

We are consistently releasing helpful updates and fun features in Grafana Cloud, our fully managed observability platform powered by the open source Grafana LGTM Stack (Loki for logs, Grafana for visualization, Tempo for traces, and Mimir for metrics). In case you missed it, here’s a roundup of the latest and greatest upgrades for Grafana Cloud this month. If you’re not a Grafana Cloud user, what are we waiting for?

Receive Cribl Notifications on a Distribution List or Group Email Alias

IT and security teams have several products they use and in turn, have many admins. Some have wide privileges, while others have focused responsibilities for the various tools and touch points in an IT and security data path. Not all admins are members of all tools. But they are all typically part of a larger group bound by an email alias (aka a distribution list).