Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

Designing Honeycomb for Our Users

You might have noticed some visual changes happening in Honeycomb lately. Colors, typography, icons, and some features have started to look a bit different. While these changes are just beginning to make their way into the product, we’ve been working on them for some time. Let’s look at what has been going on behind the scenes to make them happen.

How to Use Observability to Reduce MTTR

When you’re operating a web application, the last thing you want to hear is “the site is down." Regardless of the reason, the fact that it is down is enough to cause anyone responsible for an app to break out into a sweat. As soon as you become aware of an issue, a clock starts ticking — literally, in some cases — to get the issue fixed. Minimizing this time between an issue occurring and its resolution is arguably the number one goal for any operations team.

Leverage Observability With OpenTelemetry to Understand Root Cause Quickly

An observability solution should help any incident responder understand what changed and why. A lot has been written on the difference between monitoring and observability, but an easy way to understand how both are integral to incident response is to consider how customers use PagerDuty—with both monitoring and observability tools—to get to the right answer.

o11ycon Keynote

presented at o11ycon+hnycon, June 9-10, 2021 Nora Jones, CEO @ Jeli, Charity Majors, CTO & Co-founder@ Honeycomb o11ycon Keynote Nora Jones and Charity Majors will share their experiences leading major movements shaping the future of shipping software. Nora Jones is CEO of Jeli, and former engineer at Netflix and Slack will share her research and experience with Chaos Engineering, human factors, and site reliability. Charity Majors is Honeycomb's CTO and co-founder, who pioneered Observability as a software practice for modern teams.

Performance analysis for supported modules with Honeycomb

The Infrastructure Automation Content (IAC) team noticed some supported modules tests were taking significantly more time than others. David Schmitt, Principal Software Engineer on the IAC team, explains how Puppet utilises Honeycomb to debug our supported modules for potential performance bottlenecks.

Module development failure analysis with Honeycomb

Writing modules for yourself is easy, but writing modules for other people to use? Not so much. Failures in modules can have major repercussions, and our IAC team in Puppet takes that very seriously. Listen as David Schmitt and Daniel Carabas walk you through how we utilise Honeycomb for failure analysis with Github Actions during module development.

OpenTelemetry, Not Just for Production Troubleshooting

OpenTelemetry, Not Just for Production Troubleshooting: How to Prevent Downtime as Early as Local Dev OpenTelemetry is a great tool for observability and debugging in production. It provides you with data that empowers understanding of what is slow or broken, as well as what you can do to fix problems that occur in production. But what if you could leverage those same OpenTelemetry capabilities in pre-production? What if you could use those capabilities during development and testing phases to proactively prevent downtime in production?