Operations | Monitoring | ITSM | DevOps | Cloud

Splunk

How To Prepare for a Site Reliability Engineer (SRE) Interview

Site reliability engineering continues to gain traction in software development and IT. SRE is at the crossroads of software development and IT operations. In Ben Treynor’s words, SRE is “what happens when you ask a software engineer to design an operations function.” Site reliability engineering is a way for developers to actively build services and functions to improve the resilience of people, processes and technical systems.

What's A Sysadmin? The System Administrator Role Explained

For decades, system administrators have worked largely in the shadows to maintain the accessibility and uptime of your most important IT services. And, while the rise of DevOps and cloud computing has led to more people with a hybrid sysadmin/developer skillset, the primary duties of a system administrator will always be required.

An Overview of Microsoft Azure Services

Azure is the public cloud computing platform of Microsoft and it offers: Azure comprises more than 600 cloud services and supports varied operating systems, databases and developer tools. And the best part (in our opinion)? Splunk On-Call integrates with Microsoft Azure to help on-call teams improve incident response for Azure-based environments.

Using the Density Function for Adaptive Thresholding with Splunk

It’s 3PM on a Friday, and your day is winding down. Suddenly, you get an urgent email from your boss asking you to set up an alert for monitoring volume. You consider this an easy task. You set a hard threshold for what you think is a low volume based on the last four hours of incoming data.

Time Series Forecasting Use Cases and Anomaly Detection

Wouldn’t it be great to peek into the future and find answers to the problems that you’re facing today? This may sound like science fiction, but many companies currently possess this capability, and they are creating strategies around it to strengthen their monitoring and analytical capabilities. One way is time series forecasting, a statistical method. You can take advantage of the insights of time series forecasting by using techniques like anomaly detection to gain.

Log Management: A Useful Introduction

We find ourselves submerged in a sea of software applications practically all the time. Their primary job is to make life easier and help us accomplish certain tasks. However, these applications require a lot of data. What’s more, their development requires a systematic approach with proper management of that data — and its related activities. But that’s not a straightforward and simple process. What happens if these applications stop running?

Adding RUM to Your ITSI Cocktail: Content Pack for Splunk Observability V2

Want to improve your outlook with a splash of RUM? In our pursuit of connecting users to the right data at the right time, we’ve come to see Real User Monitoring as an invaluable tool for understanding the total picture when it comes to your web properties, apps, and cloud footprint. Do you find yourself asking any of these questions?

Dashboard Studio: More Maps & More Interactivity

In Splunk Cloud Platform 8.2.2203, we're continuing to expand on interactivity capabilities and visualizations for Dashboard Studio. We've added the ability to use search results and job metadata as tokens, and pass tokens through drilldowns to other dashboards. There is a new map visualization for cluster maps and UI to match strings for dynamic coloring. And finally, we've included the ability to set a Studio dashboard as your home dashboard.

Tech Talk: DevOps Edition - Monitor and Alert on Your Kubernetes Clusters in Seconds

Watch Monitor and Alert on Your Kubernetes Clusters in Seconds to learn how Splunk Observability can help demystify challenges with monitoring distributed microservices. You’ll also view a demonstration on how to correlate application and infrastructure behavior to streamline troubleshooting and alerting on-premises and in the cloud.