%term

The latest News and Information on Log Management, Log Analytics and related technologies.

What Is APM? A Guide to Application Performance Monitoring

May 13, 2026 By Coralogix Team In Coralogix

A well-instrumented service tells your on-call engineer which deploy broke checkout, which span ate the latency budget, and which line to revert before the support queue fills up. Getting there depends on how cleanly your application performance monitoring layer turns telemetry into answers. The sections ahead walk through how APM works, the metrics and components worth tracking, the cloud-native challenges at scale, and how to evaluate APM tooling against your real workload.

Read Post

Coralogix

Read more about What Is APM? A Guide to Application Performance Monitoring

What Is an Incident Commander? Role, Skills, and Best Practices

May 13, 2026 By Coralogix Team In Coralogix

The fastest incident response teams treat coordination as a craft. Someone owns the call, drives the decisions, and keeps everyone moving in the same direction while the team puts the system back together. That person is the incident commander (IC), and getting the role right is what separates your 15-minute fix from a four-hour war room where nobody’s sure who’s making the call.

Read Post

Coralogix

Read more about What Is an Incident Commander? Role, Skills, and Best Practices

Contributing Distributed Partition Ownership to the Azure Event Hub Receiver

May 12, 2026 By Dylan Strohschein In ObservIQ

If you're running OpenTelemetry collectors against Azure Event Hubs, distributed partition ownership and checkpointing just got significantly better. Your fleet now self-organizes. Failover is automatic. Restarts don't lose data. Here's how we got here.

Read Post

ObservIQ

Read more about Contributing Distributed Partition Ownership to the Azure Event Hub Receiver

OpenTelemetry Fleet Management: Scalable Control

May 12, 2026 By Coralogix In Coralogix

OpenTelemetry has turned observability pipelines into production infrastructure, but managing them at scale often creates a massive operational burden. In this demo, we show how Coralogix Fleet Management acts as the central control plane for your OTel ecosystem, providing the governance and orchestration required for modern DevOps. Stop the "manual marathon" of PRs and Helm upgrades. Move toward a safer, more predictable operating model where telemetry is consistent, audited, and scalable.

View Video

Coralogix

Read more about OpenTelemetry Fleet Management: Scalable Control

The Best Kubernetes Monitoring Tools of 2026

May 11, 2026 By Libi Michelson In logz.io

Effective Kubernetes monitoring in 2026 is critical due to increased cluster scale and microservices complexity, demanding a shift toward unified observability (logs, metrics, and traces). The core focus is leveraging AI-driven features to automate anomaly detection, correlate diverse data, and significantly reduce Mean Time to Recovery (MTTR).

Read Post

logz.io

Read more about The Best Kubernetes Monitoring Tools of 2026

AURA in Practice: Mezmo's SRE bot, demo walkthrough

May 11, 2026 By Mezmo In Mezmo

A walkthrough of the Slack-based SRE bot Mezmo's engineering team built on AURA, the open-source agent harness, running against Mezmo's own production tooling. Adrian Furlong shows the bot answering questions in a DM with tool calls visible inline, then in a shared channel where it reads the conversation before responding. He opens a fresh PagerDuty incident on camera. The webhook fires AURA, and within seconds, the agent posts a triage note back on the incident and a structured analysis in the dedicated incident channel.

View Video

Mezmo

Read more about AURA in Practice: Mezmo's SRE bot, demo walkthrough

Managing OpenTelemetry at Scale: Why OTel Pipelines Need a Control Plane

May 10, 2026 By Jonny Steiner In Coralogix

OpenTelemetry made telemetry possible everywhere – turning observability pipelines into distributed production infrastructure. Distributed infrastructure requires a control plane for inventory, governance, and safe change. At 500 collectors across hybrid environments, operational overhead becomes a production risk. The moment telemetry pipelines become a distributed infrastructure, they inherit the operational problems of one.

Read Post

Coralogix

Read more about Managing OpenTelemetry at Scale: Why OTel Pipelines Need a Control Plane

From noise to knowledge: How GenAI is revolutionizing log management and analytics

May 8, 2026 By Elastic Observability Team In Elastic

Focusing on GenAI and logs for IT efficiency Efficiency is everything for managing today’s digital systems. Technology is constantly transforming and expanding operations are driving an explosion in data. Consequently, data ingest and storage costs have soared. But it’s not just storage data costs that keeps teams behind.The challenge of managing all that observability data forces IT teams to choose between efficiency and the bottom line.

Read Post

Elastic

Read more about From noise to knowledge: How GenAI is revolutionizing log management and analytics

Federated Search | From Silos to Insight | AWS S3 Schema Discovery with Splunk-Managed Tables

May 8, 2026 By Splunk In Splunk

This walk-through shows how Splunk's crawler, available through the Data Management app, can discover schema and partition keys for S3 backed datasets and create Splunk managed catalog tables. Once the data is mapped, analysts can search AWS S3 data through Splunk and bring it into broader security, observability, and operational workflows.

View Video

Splunk

Read more about Federated Search | From Silos to Insight | AWS S3 Schema Discovery with Splunk-Managed Tables

The Journey to Production AI: Five Steps for SRE and Platform Teams

May 8, 2026 By Mezmo In Mezmo

In a recent webinar, The Journey to Production AI, Andre Elizondo walked through what separates a working agent demo from an agent worth trusting on a 2 a.m. page. Live polls during the session put numbers behind a pattern most platform teams already feel. ‍ ‍ Most teams are early. The ones who are further along did not get there by shipping a flashier demo. They got there by treating production AI as a platform problem.

Read Post