Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Log Management, Log Analytics and related technologies.

What Is AI Search? How It Improves Your Search Results [Quick Question Ep. 1]

In this brief episode, I explore how AI is revolutionizing the way we search. From understanding user intent to personalizing results, ranking, summarizing, and even multimodal search, AI is transforming every aspect of search relevance and performance. If you’ve ever wondered whether you really need all this context in search, the short answer is: absolutely. Watch to learn how AI search works and why it matters more than ever in the age of artificial intelligence.

How to Cut Observability Costs with Synthetic Monitoring and Responsive Pipelines

Platform teams are struggling with observability noise, bloated storage costs, and lack of clarity during incidents. Most teams capture everything all the time, leading to expensive, overwhelming, and often unnecessary data volumes. In Telemetry for Modern Apps, Mezmo teamed up with Checkly to demonstrate how synthetic monitoring triggers and responsive telemetry pipelines can help reduce costs while maintaining the context needed during incidents.

Six platform updates giving you time back in your day

Ever look at your to-do list at the end of the day and realize it’s grown longer, not shorter? We get it—there’s always more to do and never enough time. But if you’re a Sumo Logic user, reading this blog will be a win for your day because we’re giving you six ways to slash the time you spend on tasks in your platform.

From Sequential Bottlenecks to Concurrent Performance: Optimizing Log Processing at Scale

We optimized log processing pipeline by moving from sequential to concurrent processing at the entry level, achieving 30% higher throughput and better resource utilization without increasing infrastructure costs. When customers start sending millions of logs per minute, you quickly discover whether your processing pipeline can actually scale with vertical scaling.

IT Service Performance Monitoring: Key Metrics, Best Practices, and Future Trends

As organizations rely more on complex IT systems and cloud-based services, keeping everything running smoothly — and reliably — has become a top priority. That’s where IT service performance monitoring comes in, giving teams the visibility they need to make sure systems stay healthy and responsive. By tracking a range of technical and user-focused metrics, businesses can quickly identify and address issues before they impact operations or end users.

The AI Monitoring crisis that no one's talking about

When I spoke at AWS London earlier this year, I had the chance to discuss something that more and more teams are starting to feel: traditional observability doesn’t cut it for AI systems. In AI, “Is it running?” is no longer enough. We have to ask, “Is it right?” When I delivered that line, I saw the heads nodding. Everyone’s excited to build with LLMs, but when it comes to actually monitoring them in production? That’s where things fall apart.

Query and Analyze Logs Visually, Without Writing LogQL

It’s 2 AM. An incident’s in progress. Error rates are climbing. You jump into the logs, filter by service, adjust the time window… and now you need a LogQL query. You write one. It errors out. You fix the syntax, try again, only to realize you need a different filter or a new aggregation. Back to rewriting. By the time you’ve got the query right, you’ve already lost 10–15 minutes. The system is still broken, and you still don’t know why.