With OpenTelemetry, ComplyAdvantage overhauled its observability (twice)

With OpenTelemetry, ComplyAdvantage overhauled its observability (twice)

Jan 3, 2024

ComplyAdvantage, which provides compliance and risk management tools, has overhauled its observability platform twice in two years, first moving from on-prem Grafana OSS to Datadog, and then migrating from Datadog to Grafana Cloud. Join Principal SRE Adam Wilson to hear how his team’s approach to observability evolved, and how their increased OTel usage made it possible to migrate twice — and to get the most out of Grafana Cloud for metrics, logs, traces, Kubernetes monitoring, and more.

Chapters

0:00 Introduction

2:49 Why Migrate? (Inspired by Google SRE Book)

3:45 ComplyAdvantage Infrastructure

7:25 OpenTelemetry Has Entered the Chat

10:31 Distributed Tracing

11:35 Observability Infrastructure

14:52 Sampling - do you really need all that data?

16:44 Timeline of the first migration

17:39 Why we decided to migrate a second time

19:00 The second migration, this time with Grafana

21:09 Telling stories with data

26:00 Timeline of second migration

26:18 Lessons learned and reflection

#observability
#migration
#grafana
#infrastructure