Operations | Monitoring | ITSM | DevOps | Cloud

The Velocloud Network Monitoring Jigsaw Puzzle: Putting Your Network Pieces Together

Have you ever completed a jigsaw puzzle? If so, you know that it takes patience, focus, and attention to detail. Each piece plays a crucial role in completing the final picture, and missing even one piece can result in an incomplete and unsatisfactory result. In the same way, managing a network requires attention to detail and the ability to piece together all the different elements to ensure optimal performance.

Centralized Log Management Best Practices and Tools

Centralized logging is a critical component of observability into modern infrastructure and applications. Without it, it can be difficult to diagnose problems and understand user journeys—leaving engineers blind to production incidents or interrupted customer experiences. Alternatively, when the right engineers can access the right log data at the right time, they can quickly gain a better understanding of how their services are performing and troubleshoot problems faster.

What is an ESXi cluster, and how do you cluster ESXi servers

ESXi clusters involve a combination of ESXi hosts, VMware services, and vCenter to optimize load balancing, availability, and resource management for virtual machines (VMs). These clusters feature a vCenter server that centralizes the management process to facilitate shared resources that drive higher availability, scalability, and load-balancing capabilities.

MIAX and Cribl Stream: Enriching Data for Improved Observability and Faster Time to Value

Using Cribl Stream for observability is a given, but what about using Cribl Stream to get MORE from your data? Observability is all about being able to collect, route, store, and search your data. Implementing enrichment with observability provides more context and elevates your ho-hum data to robust information. This is key to faster, more confident decision-making!

Kubernetes CPU Requests & Limits VS Autoscaling

In a prior blog post, we discussed the basics of Kubernetes Limits and Requests: they serve an important role to manage resources in cloud environments. In another article in the series, we discussed the Out of Memory kills and CPU throttling that can affect your cluster. But, all in all, Limits and Requests are not silver bullets for CPU management and there are cases where other alternatives might be a better option.

How We Define SRE Work, as a Team

Last year, I wrote How We Define SRE Work. This article described how I came up with the charter for the SRE team, which we bootstrapped right around then. It’s been a while. The SRE team is now four engineers and a manager. We are involved in all sorts of things across the organization, across all sorts of spheres. We are embedded in teams and we handle training, vendor management, capacity planning, cluster updates, tooling, and so on.

Amazon Linux 2023: Why we're moving to AL2023

Amazon Web Services (AWS) recently announced the release of Amazon Linux 2023 (AL2023) as the next generation of Amazon Linux with enhancements to its already-proven reliability. Besides offering frequent updates and long-term support, AL2023 provides a predictable release cadence, flexibility, and control over new versions. It also eliminates the operational overhead that comes with creating custom policies to meet standard compliance requirements.

PC Pro Gives WhatsUp Gold Five Stars, Adding to Accolades

Popular computer magazine PC Pro added WhatsUp Gold to its "A List" of products for advanced features and ease and affordability of licensing, awarding it five stars in its review. WhatsUp Gold, first launched in 1996, has been growing in features and refinements for the past 27 years. More recently, the IT infrastructure monitoring (ITIM) solution gained capabilities another way.

How to Detect and Identify Intermittent Network Problems

We talk a lot about intermittent network problems. There’s a reason for that. Many network problems are intermittent, which means that they are sporadic, and much more difficult to pinpoint and troubleshoot than constant network problems. Picture this: You're in the middle of an important video conference call with a potential client when suddenly your network connection drops, leaving you stranded and looking like a pixelated mess on the other end. Sound familiar?