Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Redundancy vs. Resiliency in IT: What's The Difference?

Redundancy and resiliency are both important factors for keeping things running smoothly in many industries. For example: Even small businesses, like home-based operations or mom-and-pop shops, should think about redundancy and resiliency to avoid disruptions in their day-to-day work. While researching for this article in my home office, my internet service went out and stayed out for a couple of hours.

Fusion Teams: What Are They?

With more organizations becoming tech-enabled to tackle the AI boom, a new term has emerged: the fusion team. At least 84% of companies and 59% of government entities have set up “fusion teams," according to Gartner data. A new concept coined by Gartner, the fusion team aims to encourage collaborative development among technology and business teams. But what exactly is a fusion team, and why is it becoming increasingly important in today's business landscape?

New GenAI Search Revamps Customer Experience

Splunk has launched a GenAI summary feature in splunk.com and docs.splunk.com search platforms designed to give users a quick and accurate glance of the most pertinent information they are looking for. This GenAI feature serves up a contextual high-level summary pulled from various relevant search results on topics ranging from Splunk product and feature usage to general Splunk terminology.

Observability Meets Security: Build a Baseline To Climb the PEAK

When we hunt in new environments and datasets, it is critical to build an understanding of what they contain, and how we can leverage them for future hunts. For this purpose, we recommend the PEAK Threat Hunting Framework's baseline hunting process.

What Is Five 9s in Availability Metrics?

What comes to mind when you hear that an IT component has “five 9s availability”? Five 9s availability of >= 99.999% is the peak metric for IT availability. Five 9s predicts that a measured component — whether it is a server, communication line, app, service, or any other item — will be available at least 99.999% of the time during a specific period.

Splunk Named a Leader in the Gartner Magic Quadrant for Observability Platforms

"Transformative Solution" says a Director of IT in a $30B+ retailer. "Best Monitoring and Observability Tool > Splunk," is how a software engineer in a software company labels it. These are only a couple of the terms our customers use when describing the value they are getting from Splunk. With these descriptions in mind, we are elated that Splunk has been named a Leader in the 2024 Gartner Magic Quadrant for Observability Platforms for the second year in a row in this category.

Unlock the Value of Cloud: Introducing Splunk Cloud Value Calculator

In the rapidly evolving digital landscape, organizations are increasingly turning to the cloud powered with AI capabilities to enhance efficiency, scalability and innovation. Splunk, a leader in security and data observability, has been at the forefront of this transformation.

Setting up and Understanding OpenTelemetry Collector Pipelines Through Visualization

Observability provides many business benefits, but comes with costs as well. Once the (not-insignificant) work of picking a platform, taking an inventory of your applications and infrastructure, and getting buyin from leadership (both from the business and engineering sides of the house) is done, you then have to actually instrument your applications to emit data, and build the data pipeline that sends that data to your observability system.

Chaos Testing Explained

Chaos testing is a part of site reliability engineering (SRE). In chaos testing, we intentionally break things in and around a given application, in order to: The purpose of chaos testing is to assess how software systems respond to scenarios like network outages, hardware failures, database failures, and server or cluster node failures in the infrastructure.