Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

A glimpse into the day-to-day life of a software monitoring expert

Working in the field of software monitoring may seem boring or too technical, but let me tell you that there is more fun and excitement than one might imagine at first. Not that we’re all day doing barbecues and celebrating, but once we almost did our very own Olympics in the office! Kind of like The Office, you know. *Long live Michael Scott. Anyway, join me on this journey for a day in the life of a software monitoring expert, where code lines mingle with laughter and soluble coffee.

What's new in distributed trace visualization in Grafana

At Grafana Labs, we are constantly improving our feature set, and tracing is no different. Traces are often overshadowed by logs and metrics, but they’re a pillar of observability for a reason. Used correctly, organizations that can quickly and successfully follow a chain of events through a system gain a more holistic view of their systems and are better equipped to find and fix issues faster.

Unraveling AWS Lambda: Exploring Scalability and Applicability

In our previous blog, we shared our firsthand experience of implementing a tracing collector API using serverless components. Drawing parallels with Amazon Prime Video’s architectural redesign, we discussed the challenges we encountered, such as cold-start delays and increased costs, which prompted us to transition to a non-serverless architecture for more efficient solutions.

IT Event Correlation: Software, Techniques and Benefits

IT event correlation is the process of analyzing IT infrastructure events and identifying relationships between them to detect problems and uncover their root cause. Using an event correlation tool can help organizations monitor their systems and applications more effectively while improving their uptime and performance.

When Third-Party Plugins Go Wild

Every single day RapidSpike detects thousands of problems with website third-party plugins that are causing revenue and customer experience issues, and 90% of them are not just affecting our users; they are affecting every user of that third party. The difference is with RapidSpike, we tell them about it. In 2018, a major e-commerce website experienced a significant performance failure due to a third-party plugin.

Microsoft Teams Monitoring to Troubleshoot & Optimize Performance

If there's one thing we know about successful teams, it's that they need top-notch communication to conquer the corporate jungle. That's where Microsoft Teams swoops in to save the day! As you probably already know, Teams is the ultimate collaboration playground for businesses, connecting people, and getting things done in a snap. But here's the thing: even the most powerful tools need a little TLC to stay in their prime. And that's where we come in!

Introducing the new Lumigo Live Tail

As developers, we understand the immense value of having real-time access to live traces. It significantly enhances our ability to identify, debug, and troubleshoot potential issues within applications, streamlining the development and deployment process. Today, we are excited to introduce the new and improved Live Tail feature at Lumigo, which enhances your observability experience to a whole other level.

Cut through the complexity of your public sector cloud migration

Learn how application performance monitoring can ease cloud migration challenges for public sector agencies. Cloud technology has come of age, and organizations across every industry are rapidly migrating their key applications to these flexible environments. Like their enterprise counterparts, public sector agencies are excited about the potential of cloud services.

The Road Ahead: 4 Ways AIOps Will Build More Resilient IT Operations

This article is the final installment in a 4-part series on leveraging artificial intelligence and machine learning (ML) for IT operations (AIOps) to provide a more efficient, reliable, agile, cost-effective, and optimized IT infrastructure. Just as our roads and highways evolve overtime to meet the demands of the travelers who use them, AIOps will continue to transform how organizations build, use, and manage their infrastructures.