Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Navigating Industry-Specific Challenges in IT Infrastructure Monitoring and Automation

In navigating the intricate landscape of IT infrastructure, NiCE IT Management Solutions’ whitepaper delves deep, unraveling the historical tapestry shaping the contemporary technological terrain. It dissects industry-specific challenges, offers tailored insights into crafting monitoring and automation solutions, and spotlights key players’ pivotal roles.

The Benefits of Secure File Sharing for Your Business

In the digital age, businesses rely heavily on the exchange of information to function efficiently. However, with the increasing volume of data being shared both internally and externally, security concerns have become a paramount issue. Secure file sharing has emerged as a crucial solution to mitigate risks associated with unauthorized access, data breaches, and compliance violations. Implementing secure file-sharing mechanisms not only protects sensitive information but also enhances productivity and collaboration within the organization.

Shedding Light on Network Visibility: Don't Wait for End Users to Report Issues

Imagine driving a car at night without headlights. It's risky, right? Without them, you can't see where you're going, and you might crash. Well, the same goes for visibility in enterprise networks. If you can't see how they're performing, you might run into problems with important apps, slow Internet, and unhappy customers. That's why having the right tools to see what's happening on your network is crucial. It's like turning on those headlights to make sure you're going in the right direction.

The Ultimate CPU Alert - Reloaded, Again!

It’s been nearly ten years since “The Ultimate CPU Alert – Reloaded” and its Linux version were shared with the SolarWinds community. At that time, managing CPU data from 11,000 nodes, with updates every five minutes to a central MSSQL database, was a significant challenge. The goal was to develop alerting logic to identify when a server was experiencing high CPU usage accurately.

Navigating the VMware by Broadcom Acquisition

In November 2023, the technology landscape witnessed a monumental shift with Broadcom's acquisition of VMware for $69 billion. This strategic move not only redefines the contours of IT and cloud computing but also signifies a new era for the new giant. Broadcom's leap towards subscription-based services and hybrid cloud environments echoes a broader industry evolution towards more flexible, service-oriented architectures.

The Frugal Architect, Law IV: Unobserved Systems Lead To Unknown Costs

This is part four of seven in our Frugal Architect blog series. Read the rest of the series here. In case you weren’t as giddy as CloudZero was at re:Invent this past year, we wanted to recount the seven laws outlined by Werner Vogels, Amazon’s CTO, which he’s bundled into a framework called “The Frugal Architect” (check out the whole framework here). What is “The Frugal Architect”?

From MLOps to LLMOps: The evolution of automation for AI-powered applications

Machine learning operations (MLOps) has become the backbone of efficient artificial intelligence (AI) development. Blending ML with development and operations best practices, MLOps streamlines deploying ML models via continuous testing, updating, and monitoring. But as ML and AI use cases continue to expand, a need arises for specialized tools and best practices to handle the particular conditions of complex AI apps — like those using large language models (LLMs).

Finding the common ground with executives in incidents

I spotted this thread on Reddit, discussing the pains of executives dropping into incidents, and the corresponding impact it can have on the incident response process. Being an SRE community, it was a little more of a one-sided account of the situation. So let’s look a little closer, and dive into what it takes to make incidents better for responders and executives alike.