Operations | Monitoring | ITSM | DevOps | Cloud

Blog

I'm a VMware Admin: What Do I Do with Tanzu Kubernetes Clusters?

In previous blog posts, we’ve talked about the process of setting up vSphere with Tanzu (see our quick start guide) and creating your first Tanzu Kubernetes Cluster (TKC). As a vSphere Administrator, you might be saying to yourself, “This is cool and all, but what’s next? What’s an easy application to deploy?” The easiest target is the standard NGINX Kubernetes deployment, but that’s very basic.

Will XDR Change Incident Response?

Extended Detection and Response (XDR) is a new security technology that promises to change the way security organizations operate, and introduce important efficiencies to day-to-day processes. In particular, XDR is expected to have a huge impact on incident response teams. In this article, we’ll explain the basics of XDR, show how it addresses incident response challenges, and how it can transform traditional processes in the SOC.

Leveraging Multi-Tenancy For Your Team

Remote teams that work on dependent products face resource crunches and delays. When several teams work on interrelated projects, the chances are high that they will soon end up meeting at some point where they have to wait for resource release from another team or teams. For instance, when team A wants an update to an app currently maintained by Team B, they would want to automatically access the latest version of the app whenever a new bug is fixed and an update is made.

(Almost) Everything You Need to Know About SRE

Site Reliability Engineering (SRE) is a hot topic, but what exactly does it entail? And do you have to follow the principles to a T in order to achieve benefits from it? If you’re searching for answers to these common questions, look no further. In this episode of the Cloud & Culture podcast, VMware Tanzu’s Hannah Foxwell explains the what, why, and how of SRE—from key principles (such as SLI, SLO, and error budgets) to real-life examples of enterprise adoption.

The Nexthink Library: Q4 Overview

The Nexthink Library allows IT teams to extend and tailor their Nexthink Experience platform to cater to their specific business, IT and employee initiatives. By leveraging over 100+ out-of-the-box content packs and integration, users can use actionable content to overcome some of the most common IT challenges with ready-to-use dashboards, metrics, Engage campaigns, remote actions, targeted insights and more.

10 Elasticsearch Configurations You Have to Get Right

Elasticsearch is an open source, distributed document store and search engine that stores and retrieves data structures. As a distributed tool, Elasticsearch is highly scalable and offers advanced search capabilities. All of this adds up to a tool which can support a multitude of critical business needs and use cases. To follow are ten of the key Elasticsearch configurations are the most critical to get right when setting up and running your instance.

Boosting Monitoring Efficacy for Telecom via Correlation Analysis

In the first and second part of the blog series, we discussed the importance of correlation analysis in root cause analysis in general and in the context of promotional marketing. In a similar vein, in this blog we walk through how to leverage correlation analysis to address challenges in the telecom space.

Where is Your Next Release Bottleneck?

A typical modern DevOps pipeline includes eight major stages, and unfortunately, a release bottleneck can appear at any point: These may slow down productivity and limit a company’s ability to progress. This could damage their reputation, especially if a bug fix needs to be immediately deployed into production. This article will cover three key ways using data gathered from your DevOps pipeline can help you find and alleviate bottlenecks in your DevOps pipeline.

Explore your data effortlessly with the Datadog Clipboard

When investigating a complex system—or learning about it for the first time—you need to explore metrics, traces, logs, and other kinds of data. But as you navigate across different views of your data in dashboards, alert notifications, flame graphs, and so on, it can be hard to keep track of what you have already seen. When a potential issue comes up and time is tight, the last thing you need is to spend time remembering a crucial graph or finding the right browser tab.