Operations | Monitoring | ITSM | DevOps | Cloud

Honeycomb

OpenTelemetry Best Practices #2 Agents, Sidecars, Collectors, Coded Instrumentation

For years, we’ve been installing what vendors have referred to as “agents” that reach into our applications and pull out useful telemetry information from them. From monitoring agents, to full-blown APM tools, this has been the standard for many decades. With OpenTelemetry though, the term “agent” isn’t used as much, and in most scenarios means something slightly different.

Focused Labs & Honeycomb: Better Together

We're excited to unveil a new collaboration with Focused Labs, a leap forward in our shared commitment to advancing modern observability practices and enhancing the robustness of legacy systems. This partnership is not just about scaling our service offerings but also about integrating Focused Labs' deep engineering expertise with our observability platform to deliver unparalleled customer experiences.

OpenTelemetry Best Practices #1: Naming

Naming things, and specifically consistently naming things, is still one of the most useful pieces of work you can do in telemetry. It’s often overlooked as something that will just happen naturally and won’t cause too much of an issue—but it doesn’t happen naturally, it does cause issues, and you end up having to fix the data in pipelines or your backend tool.

Driving Culture Change: Phorest's Observability Transformation

Phorest wanted a tool to help foster a culture of observability among the engineers at an affordable and predictable price. With their application stack hosted on AWS, Phorest delivers a premier software solution that empowers their salon and spa business customers to thrive. Ensuring every engineer has access to an observability tool is integral to the company's success model, enabling them to deliver great code for their designated software services.

Negotiating Priorities Around Incident Investigations

There are countless challenges around incident investigations and reports. Aside from sensitive situations revolving around blame and corrections, tricky problems come up when having discussions with multiple stakeholders. The problems I’ll explore in this blog—from the SRE perspective—are about time pressures (when to ship the investigation) and the type of report people expect.

Much Ado About OpenTelemetry

There is so much good work that OpenTelemetry has done in the software industry, specifically around the domain of observability, in the last five years. Bringing users and vendors together to define the future of telemetry? Check! Unify logs, traces, and metrics under a completely vendor-neutral API? Check! Deprecate other standards by bringing their collaborators to the table to ensure their use cases are met? CHECK!

APM From a Developer's Perspective

In twenty years of software development, I did not have the privilege of being on call, of tending to my software in production. I’ve never understood what “APM” means. Anybody can tell me what it stands for—Application Performance Monitoring (or sometimes, the M means Management)—but what does it mean? What do people use APM for?

Flight to Success: Birdie's DevOps Evolution Fueled by Observability Insights

Birdie wanted to uplevel observability to a platform that would provide meaningful insights for application performance and debugging. Ensuring customers can provide seamless and timely care to in-home patients stands as a top priority for Birdie, and the development team takes pride in building and maintaining a high-quality platform distinguished by its reliability and responsiveness.

Three Properties of Data to Make LLMs Awesome

This post first appeared on Phillip's personal blog. Back in May 2023, I helped launch my first bona fide feature that uses LLMs in production. It was difficult in lots of different ways, but one thing I didn’t elaborate on in several blog posts was how lucky I was to have a coherent way to get the data I needed to make the feature useful for users.