Operations | Monitoring | ITSM | DevOps | Cloud

Grafana Tempo 1.2 released: New features make monitoring traces 2x more efficient

Grafana Tempo 1.2 has been released! Among other things, we are proud to present both our first version to support search and the most performant version of Tempo ever released. There are also some minor breaking changes so make sure to check those out below. If you want ALL the details you can always check out the v1.2 changelog, but if that’s too much, this post will cover all the big ticket items.

Playbooks in Action: Creating Effective, Repeatable Incident Resolution Workflows

While service incidents can be wildly dissimilar, they tend to have one thing in common: a need for quick resolution. Response teams need a robust, repeatable process to follow that ensures fast, mistake-free execution, especially for those 4 AM calls. Having a documented checklist saved where the entire team can access and use it at any time could make the difference between quick resolution or compounding the problem.

Enabling SRE best practices: new contextual traces in Cloud Logging

The need for relevant and contextual telemetry data to support online services has grown in the last decade as businesses undergo digital transformation. These data are typically the difference between proactively remediating application performance issues or costly service downtime. Distributed tracing is a key capability for improving application performance and reliability, as noted in SRE best practices.

Network AF, Episode 5: Building relationships as an internet analyst with Doug Madory

Network AF welcomes Doug Madory to the podcast. Doug is a veteran, a researcher, a writer and Kentik’s director of internet analysis. With his start in the U.S. Air Force within its Information War Center, Doug has now been working in the networking industry for 12 years. After the Air Force, Doug went on to work for Renesys, which was acquired by Dyn, which was later acquired by Oracle.

Icinga Customer Story: Deutsche Telekom IT

We are proud of our many customers and users around the globe that trust Icinga for critical IT infrastructure monitoring. That´s why we’re now showcasing some of these enterprises with their Success stories. It´s stories from companies or organizations just like yours, of any size and different kinds of industries. Some of them are our long-standing customers, others have just recently profited from migrating from another solution to Icinga.

Epsagon-to-Lumigo: a step-by-step migration guide

At Lumigo. we believe in serverless technology, and our mission is to make serverless development easy and fast. For the past few months, we’ve been extending our observability and debugging capabilities, making it a breeze for developers to understand the end-to-end story of every request that goes through the system, find the root causes of issues and be able to easily address them.

New Tech Leader Survey Reveals Why the Time for Real-Time Operations is Now

“Customer obsessed.” “Customer-centric.” “Customer-first.” For CEO’s everywhere, setting and maintaining a coordinated focus on the customer has become a top priority when driving innovation. After all, for many organizations regardless of industry, digital customer experiences are what can make or break the bottom line.

3 Improvements Finance Teams Can Make To Their FP&A Process

FP&A is a strategic part of the finance organization and has the potential to drive important business outcomes. When done right, it can have a major positive impact on the future of the business. When done poorly, it can slow a company down. The role of FP&A has evolved. Today it isn’t just about taking inputs and crunching numbers — it’s about being a strategic advisor to the organization.

Incident Resolution: Do You Remember, the Twenty Fires of September?

From September to early October, Honeycomb declared five public incidents. Internally, the whole month was part of a broader operational burden, where over 20 different issues interrupted normal work. A fraction of them had noticeable public impact, but most of the operational work was invisible. Because we’re all about helping everyone learn from our experiences, we decided to share the behind-the-scenes look of what happened.