Latest Posts

Break Silos and Foster Collaboration with DevOps

Apr 27, 2022 By Stephen Watts In Splunk

DevOps is a well established discipline. By now, most developers, IT engineers and site reliability engineers (SREs) have heard all about the importance of “breaking down silos” and achieving seamless communication and collaboration across all stakeholders in the continuous integration/continuous delivery (CI/CD) process — which extends from source code development through production environment management and incident response.

Read Post

Splunk

Read more about Break Silos and Foster Collaboration with DevOps

Kubernetes Incident Response Best Practices

Apr 27, 2022 By Stephen Watts In Splunk

Inevitably, organizations that use technology (regardless of the extent) will have something, somewhere, go wrong. The key to a successful organization is to have the tools and processes in place to handle these incidents and get systems restored in a repeatable and reliable way in as little time as possible.

Read Post

Splunk

Read more about Kubernetes Incident Response Best Practices

CI/CD & DevOps Pipeline Analytics: A Primer

Apr 27, 2022 By Stephen Watts In Splunk

Tracking application-level and infrastructure-level metrics is part of what it takes to deliver software successfully. These metrics provide deep visibility into application environments, allowing teams to home in on performance issues that arise from within applications or infrastructure. What application and infrastructure metrics can’t deliver, however — at least not on their own — is breadth.

Read Post

Splunk

Read more about CI/CD & DevOps Pipeline Analytics: A Primer

Using AI & ML for Application Performance (APM)

Apr 26, 2022 By Collin Chau In Splunk

Today, IT and site reliability engineering (SRE) teams face pressure to remediate problems faster than ever, within environments that are larger than ever, while contending with architectures that are more complex than ever. In the face of these challenges, artificial intelligence has become a must-have feature for managing complex application performance or availability problems at scale.

Read Post

Splunk

Read more about Using AI & ML for Application Performance (APM)

Cloud Log Management Strategy & Best Practices

Apr 26, 2022 By Stephen Watts In Splunk

For IT Operations and Site Reliability Engineering (SRE) teams, logging is nothing new. In fact, collecting and analyzing logs is one of the oldest cornerstones of performance management. Logs have been part and parcel of APM workflows for decades. Yet the logging strategies that worked in eras past often fall short today. That’s thanks to the advent of cloud-native computing, which has ushered in fundamental new challenges in the way teams aggregate, analyze, and manage logs.

Read Post

Splunk

Read more about Cloud Log Management Strategy & Best Practices

Continuous Performance Management Basics

Apr 26, 2022 By Stephen Watts In Splunk

In this blog post, we will walk you through how to develop a continuous performance management playbook for your business to effectively integrate and manage performance as an integral part of your development process – enabling you to build fast experiences faster than your competition. Let’s dive in!

Read Post

Splunk

Read more about Continuous Performance Management Basics

Synthetic Monitoring for CI/CD Pipelines

Apr 25, 2022 By Stephen Watts In Splunk

For DevOps teams, delivering quality software has long required reconciling a major tension: In a perfect world, you’d catch every issue in each new release of your application before you deployed the release into production. But in the real world, doing so is tricky, not least because it’s hard to collect data about application performance before the application is actually deployed.

Read Post

Splunk

Read more about Synthetic Monitoring for CI/CD Pipelines

On-Premises Application Monitoring: An Introduction

Apr 25, 2022 By Stephen Watts In Splunk

In the present age of cloud-native everything, it can be easy to forget that some applications still run on-premises. But they do and managing the performance of on-premises apps is just as important as monitoring those that run in the cloud. With that reality in mind, here’s a primer on how to approach on-premises application performance monitoring as part of a broader cloud-native performance optimization strategy.

Read Post

Splunk

Read more about On-Premises Application Monitoring: An Introduction

Kubernetes Monitoring: An Introduction

Apr 25, 2022 By Stephen Watts In Splunk

One of the first things you’ll learn when you start managing application performance in Kubernetes is that doing so is, in a word, complicated. No matter how well you’ve mastered performance monitoring for conventional applications, it’s easy to find yourself lost inside a Kubernetes cluster.

Read Post

Splunk

Read more about Kubernetes Monitoring: An Introduction

Distributed Tracing Best Practices for Microservices

Apr 22, 2022 By Collin Chau In Splunk

The management of modern software environments hinges on the three so-called “pillars of observability”: logs, metrics and traces. Each of these data sources provides crucial visibility into applications and the infrastructure hosting them. For many IT operations and site reliability engineering (SRE) teams, two of these pillars — logs and metrics — are familiar enough.

Read Post

Splunk

Read more about Distributed Tracing Best Practices for Microservices

Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

Break Silos and Foster Collaboration with DevOps

Kubernetes Incident Response Best Practices

CI/CD & DevOps Pipeline Analytics: A Primer

Using AI & ML for Application Performance (APM)

Cloud Log Management Strategy & Best Practices

Continuous Performance Management Basics

Synthetic Monitoring for CI/CD Pipelines

On-Premises Application Monitoring: An Introduction

Kubernetes Monitoring: An Introduction

Distributed Tracing Best Practices for Microservices

Monthly Archive

Follow Us