Centralized logging is a critical component of observability into modern infrastructure and applications. Without it, it can be difficult to diagnose problems and understand user journeys—leaving engineers blind to production incidents or interrupted customer experiences. Alternatively, when the right engineers can access the right log data at the right time, they can quickly gain a better understanding of how their services are performing and troubleshoot problems faster.
Using Cribl Stream for observability is a given, but what about using Cribl Stream to get MORE from your data? Observability is all about being able to collect, route, store, and search your data. Implementing enrichment with observability provides more context and elevates your ho-hum data to robust information. This is key to faster, more confident decision-making!
In a prior blog post, we discussed the basics of Kubernetes Limits and Requests: they serve an important role to manage resources in cloud environments. In another article in the series, we discussed the Out of Memory kills and CPU throttling that can affect your cluster. But, all in all, Limits and Requests are not silver bullets for CPU management and there are cases where other alternatives might be a better option.
APIs have become indispensable to modern digital businesses, not least because of their flexibility and ease of integration.
Last year, I wrote How We Define SRE Work. This article described how I came up with the charter for the SRE team, which we bootstrapped right around then. It’s been a while. The SRE team is now four engineers and a manager. We are involved in all sorts of things across the organization, across all sorts of spheres. We are embedded in teams and we handle training, vendor management, capacity planning, cluster updates, tooling, and so on.
Amazon Web Services (AWS) recently announced the release of Amazon Linux 2023 (AL2023) as the next generation of Amazon Linux with enhancements to its already-proven reliability. Besides offering frequent updates and long-term support, AL2023 provides a predictable release cadence, flexibility, and control over new versions. It also eliminates the operational overhead that comes with creating custom policies to meet standard compliance requirements.
Popular computer magazine PC Pro added WhatsUp Gold to its "A List" of products for advanced features and ease and affordability of licensing, awarding it five stars in its review. WhatsUp Gold, first launched in 1996, has been growing in features and refinements for the past 27 years. More recently, the IT infrastructure monitoring (ITIM) solution gained capabilities another way.
We talk a lot about intermittent network problems. There’s a reason for that. Many network problems are intermittent, which means that they are sporadic, and much more difficult to pinpoint and troubleshoot than constant network problems. Picture this: You're in the middle of an important video conference call with a potential client when suddenly your network connection drops, leaving you stranded and looking like a pixelated mess on the other end. Sound familiar?