Operations | Monitoring | ITSM | DevOps | Cloud

Benefits of using AIOps in ITSM

“Necessity is the mother of invention,” so here is a quick backstory to understand what brought AIOps into the ITSM landscape In the fast-paced world of Information Technology Service Management (ITSM), staying ahead of challenges and effectively managing complex systems is crucial. As organizations embrace digital transformation and adopt cutting-edge technologies, the volume of data and incidents generated becomes overwhelming for IT teams to handle manually.

What are Traceroutes and How Do Traceroutes Work?

If you've ever wondered why your Internet connection seems slow or experiencing connection problems with a website, you might have heard of a tool called "traceroute." But what is a traceroute, and how does it work? In this article, we'll be giving a quick and simple introduction to what are traceroutes, and how traceroutes work to help identify and troubleshoot network problems.

3 Ways to Lower Costs and Improve Efficiency This Year and Every Year

The second half of 2023 is officially in full swing, and with that comes everyone’s favorite topic of conversation; end of year fiscal targets and annual budget reviews. For IT teams, the perennial ask will come down from above…. “we need to find $X, what can we cut, where can we find efficiencies and how much can your department save?”. You need to figure out how to save money and improve efficiency – and you don’t have much time to do it.

What is Garbage Collection in Java: Detailed Guide

The Garbage Collection (GC) feature in the Java Virtual Machine (JVM) is truly remarkable. It automatically identifies and cleans up unused Java objects without burdening developers with manual allocation and deallocation of memory. As an SRE or Java Administrator you need a strong understanding of the Java Garbage Collection mechanism to ensure optimal performance and stability of your Java applications.

Using UX and Observability to Track Application Health

UX (user experience) is a core factor that determines the success of an application or platform in a distributed system. Specifically, developers need to understand the infrastructure within an entire application stack to improve and refine the user experience to meet customer expectations without guesswork. System downtime remains a significant source of revenue and reputational losses for enterprises, employees, and customers.

Automatic log level detection reduces your cognitive load to identify anomalies at 3 am

Let’s face it, when that alert goes off at 2:58am, abruptly shaking you out of a deep slumber because of a high-priority issue hitting the application, you’re not 100% “on”. You need to shake the fog out of your head to focus on the urgent task of fixing the problem. This is where having the best log analytics tool can take on some of that cognitive load. Sumo Logic recently released new features specific to our Log Search queries that automatically detect log levels.

Anything But Tech Debt

Tech debt is usually one of the most fraught topics on engineering teams. Engineers often feel they aren’t allowed enough time to address tech debt. Product partners wonder why engineers spend so much time working on it—or at least talking about it. “The business” always seems to insinuate that engineers should do less of it, instead focusing on shipping value to customers.

7 OpenTelemetry Metrics to Track for Better Visibility

In today’s rapidly evolving software landscape, ensuring observability is crucial for building robust and reliable applications. One of the critical components of observability is metrics, which provide valuable insights into the performance and behavior of our systems. OpenTelemetry, an open-source observability framework, offers a standardized approach to capturing, exporting, and analyzing metrics. This blog post explores seven OpenTelemetry metrics for tracking better visibility.