Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Log Management, Log Analytics and related technologies.

The AI Monitoring crisis that no one's talking about

When I spoke at AWS London earlier this year, I had the chance to discuss something that more and more teams are starting to feel: traditional observability doesn’t cut it for AI systems. In AI, “Is it running?” is no longer enough. We have to ask, “Is it right?” When I delivered that line, I saw the heads nodding. Everyone’s excited to build with LLMs, but when it comes to actually monitoring them in production? That’s where things fall apart.

How Payconiq Centralized Monitoring and Enabled Real-Time Insights with Elastic

Yannick Boulleys, Head of Platform at Payconiq, shares how Elastic helped the company consolidate fragmented monitoring tools into a single platform. With real-time user monitoring, built-in anomaly detection, and GenAI-powered root cause analysis, Elastic has transformed how Payconiq manages system visibility, consumer behavior, and cost efficiency, without requiring deep technical expertise.

Unlock Deeper Insights: Introducing GitLab Event Integration with Mezmo

Following the popularity of our existing GitHub integration, we’ve extended similar capabilities to GitLab users. You can now ingest GitLab events directly into Mezmo Telemetry Pipelines and route them to any destination. This provides a powerful new way to monitor, alert, and react to activity within your GitLab repositories.

Query and Analyze Logs Visually, Without Writing LogQL

It’s 2 AM. An incident’s in progress. Error rates are climbing. You jump into the logs, filter by service, adjust the time window… and now you need a LogQL query. You write one. It errors out. You fix the syntax, try again, only to realize you need a different filter or a new aggregation. Back to rewriting. By the time you’ve got the query right, you’ve already lost 10–15 minutes. The system is still broken, and you still don’t know why.

Elasticsearch is a recommended vector database in the NVIDIA Enterprise AI Factory validated design

Elastic now integrates with the NVIDIA Enterprise AI Factory validated design to provide users with a recommended vector database for their on-premises AI Factories. The validated design provides enterprises with a framework for building and deploying AI Factories on-premises.

MCP Server on Splunk Cloud Platform Demo

Discover the future of data interaction! This video introduces the Model Context Protocol (MCP) server on Splunk Cloud Platform, a groundbreaking capability that seamlessly connects your Splunk data with advanced AI models (LLMs). Learn how to leverage natural language to query, analyze, and manage your Splunk environment without complex SPL. In this comprehensive setup and configuration guide, we'll walk you through.

Kibana Logs: Advanced Query Patterns and Visualization Techniques

Kibana gives you a structured way to explore log data indexed in Elasticsearch. With the right queries and visualizations, you can identify anomalies, debug issues more quickly, and track trends across services. This blog covers practical ways to query logs using Kibana’s Lucene and KQL syntax, build visualizations that surface meaningful signals, and set up dashboards for ongoing log-based monitoring.

Build Log Automation with Last9's Query API

Manual log investigation is one of those engineering tasks that quietly drains hours without offering much real value. You're debugging an incident. Monitoring shows elevated error rates. Now begins the familiar drill: It’s a tedious cycle, and it doesn’t scale. The whole process breaks down when you’re trying to automate incident response, run continuous security monitoring, or generate compliance reports.

How to Troubleshoot Outages Faster Using Elastic Observability [2 Min Live Demo]

In this video, I’ll show you how Elastic Observability helps you reduce downtime, accelerate root cause analysis, and unify logs, metrics, and traces in one powerful dashboard. With native OpenTelemetry support, AI-powered troubleshooting, and built-in anomaly detection, you can streamline your workflows and boost service reliability.