Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

GrafanaCONline 2021: Your guide to the newest announcements from Grafana Labs

In addition to all the great talks from community members about their use cases, GrafanaCONline 2021 will include a number of sessions with the Grafana team about the latest features and use cases for Grafana. Throughout the week, we’ll continue to unveil new features, go deeper with live demos, and share our plans about the future of Grafana.

Unified Observability: A Business-Centric View

Here at LogicMonitor, we’re on a mission to build the most comprehensive, extensible, and intelligent monitoring and observability platform in the world to help businesses run seamlessly. We’ve spent more than a decade building a best-in-class monitoring platform. Over the past two years, however, we have further evolved our platform to deliver invaluable end-to-end observability across applications, networks, and infrastructure for companies of all sizes and in a variety of industries.

Better Alerts [as in, far more specific and just generally way better]

A couple of weeks back, we broke sign-ups. And in the most meta fashion, we learned about this because someone here had the foresight to set up an alert in Sentry to notify us if sign-ups dropped to zero. Getting alerted kicked off our incident response process. A team was formed to tackle “What broke?”, “How do we fix this?”, “How long has this been happening?”, “Are any other services impacted?”, and much more.

Rollbar Academy: Rollbar Analytics

This session focuses on revealing the operational data that is available for analysis within your Rollbar account and how to utilize it to better understand and improve your development processes. Learn how to take advantage of features like People tracking and RQL to explore error data in-depth and how to further automate these steps using the Rollbar REST API.

Incident Review - Fastly Outage Impacts Major Websites Worldwide

On June 8, 2021, many of us were left staring at blank screens or “Service Unavailable” errors when trying to access the internet. The panic was shared by millions of people around the world. Everything from Spotify, Amazon, and Reddit to Vimeo, Twitch, and Pinterest was inaccessible to users. This major outage that impacted any service using Fastly. Here is a quick rundown of what happened and why.

Dashbird app launches new version

The new Dashbird app is bringing your data together for a faster, more secure, and smoother observability experience with team collaboration in mind. The enhanced version of the Dashbird app is making your account more secure and your app navigation and data exploration faster, more intuitive, and all-around enjoyable. Additionally, you can now enable multi-factor authentication (MFA) for your Dashbird account. Check it out now!

Monitoring Kafka Performance with Splunk

Today’s business is powered by data. Success in the digital world depends on how quickly data can be collected, analyzed and acted upon. The faster the speed of data-driven insights, the more agile and responsive a business can become. Apache Kafka has emerged as a popular open-source stream-processing solution for collecting, storing, processing and analyzing data at scale.

Collecting Kafka Performance Metrics with OpenTelemetry

In a previous blog post, "Monitoring Kafka Performance with Splunk," we discussed key performance metrics to monitor different components in Kafka. This blog is focused on how to collect and monitor Kafka performance metrics with Splunk Infrastructure Monitoring using OpenTelemetry, a vendor-neutral and open framework to export telemetry data. In this step-by-step getting-started blog, we will.