Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Top 10 Kubernetes Alternatives to Consider in 2025

Organizations exploring Kubernetes alternatives often face a critical decision when choosing the right container orchestration solution. While Kubernetes has established itself as the industry standard, companies are increasingly seeking alternatives that better align with their deployment needs, team expertise, and operational requirements. This comprehensive guide examines the top alternatives to Kubernetes, helping you make an informed decision for your 2025 container strategy.

2025 observability predictions and trends from Grafana Labs

From AI to eBPF, 2024 reshaped the observability landscape. As we peer into 2025, Grafana Labs’ experts predict another year of innovation that will redefine how teams understand and optimize their systems, from profiling to platform engineering. Their insights align with what the community is saying, according to early responses from our third annual Observability Survey. Do you agree or disagree with the trends our team believes will transform the world of observability next year?

From Gartner IOCS 2024 Conference: AI, Observability Data, and Telemetry Pipelines

Last week, I attended one of the last conferences of the year with team Mezmo: the Gartner IT Infrastructure, Operations & Cloud Strategies Conference in Las Vegas. Not surprisingly, there were over 20 sessions covering observability and how it is getting increasingly critical in the new complex distributed computing environment. Of course, there were many sessions, including all keynotes that addressed the advent and impact of AI on IT operations and observability.

The Next Generation of AI-Powered Observability

AI is changing our world, and its impact on observability is no different. This article discusses some of the components of a good observability platform, how AI is well-positioned to revolutionize observability, and how Lumigo Copilot Beta will provide substantial value to customers and partners.

AI Log Analysis - Shaping the Future of Observability

As digital applications and infrastructures grow increasingly complex, managing and understanding log data has become increasingly vital in achieving practical observability, enabling organizations to detect, diagnose, and prevent issues across their systems. However, traditional log analysis methods often struggle with the volume and complexities of modern log data in cloud-native environments.

Our team's learnings from Kubecon: Use Exemplars, Configuring OTel, and OTTL cookbook

A few weeks ago, members of Mezmo were at Kubecon and attended several sessions. You can see a post with my recap and session highlights. Today, though, I’m going to discuss three sessions that my colleagues found interesting for our peers in Observability.

Scaling Observability on a Budget with Cribl for State, Local, and Education

Over the past year, I’ve noticed some interesting trends in my work with state and local governments. Across my conversations with organizations in this space, there’s a common thread: teams are getting creative about maximizing their limited resources. With budgets either flat or shrinking and operational demands increasing, these teams face tough choices. They’re being asked to maintain or improve services while working with the same, or in some cases, fewer resources than before.

Reduce MTTD+MTTR and Improve User Experience with Observability - Customer Brown Bag - Dec 12, 2024

Please join us as Technical Account Engineer, Duncan McKendrick, teaches how Sumo Logic's observability platform empowers teams to minimize Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR) while enhancing the overall user experience. Learn how to leverage real-time insights, streamline incident response, and ensure optimal application performance through actionable data.

Observability in the Age of AI

This post was written by Charity Majors and Phillip Carter. In May of 2023, we released the Honeycomb Query Assistant, an LLM-backed feature that lets engineers use natural language to generate and execute queries against their telemetry data. Instead of having to master a domain-specific query language, you can simply type in things like “slow endpoints by status code” and the Query Assistant will generate a relevant Honeycomb query for you to iterate on.