Coroot

https://coroot.com/

Palo Alto, CA, USA

2021

The hard part of AI root cause analysis is no longer the model

Jun 30, 2026 | By Nikolay Sivko

Every few weeks someone tells me root cause analysis is a solved problem now: pipe your telemetry into an LLM, let it tell you what broke. I wish it were that easy. After years on this, I think "can AI do RCA?" is the wrong question, because doing RCA with an LLM is really two separate jobs, and the answer is different for each. They break in completely different ways, so it's worth pulling them apart.

Read Post

Observability on Windows, before eBPF is production-ready

Jun 23, 2026 | By Nikolay Sivko

No large enterprise runs a single stack. A shiny new Kubernetes cluster sits right next to a Windows Server box that has quietly run the billing system for a decade without missing a beat. Both keep the business running. Both deserve the same visibility. Linux runs most server workloads, and Coroot grew up there. Our open-source node-agent uses eBPF to collect metrics, logs, traces, and profiles, with no code changes. But "most" is not "all".

Read Post

Zero-config Go heap profiling

Apr 27, 2026 | By Nikolay Sivko

Coroot's node-agent already collects CPU profiles for any process on the node using eBPF, with zero integration from the application side. For Java, we dynamically inject async-profiler into the JVM to get memory and lock profiles. But Go processes were still a blind spot for non-CPU profiling unless the app exposed a pprof endpoint and the cluster-agent scraped it. We wanted the same zero-config experience for Go heap profiles. This post is about how we got there.

Read Post

Profiling Java apps: breaking things to prove it works

Apr 3, 2026 | By Nikolay Sivko

Coroot already does eBPF-based CPU profiling for Java. It catches CPU hotspots well, but that's all it can do. Every time we looked at a GC pressure issue or a latency spike caused by lock contention, we could see something was wrong but not what. We wanted memory allocation and lock contention profiling. So we decided to add async-profiler support to coroot-node-agent. The goal: memory allocation and lock contention profiles for any HotSpot JVM, with zero code changes. Here's how we got there.

Read Post

Making encrypted Java traffic observable with eBPF

Mar 23, 2026 | By Nikolay Sivko

Coroot's node agent uses eBPF to capture network traffic at the kernel level. It hooks into syscalls like read and write, reads the first bytes of each payload, and detects the protocol: HTTP, MySQL, PostgreSQL, Redis, Kafka, and others. This works for any language and any framework without touching application code. For encrypted traffic, we attach eBPF uprobes to TLS library functions like SSL_write and SSL_read in OpenSSL, crypto/tls in Go, and rustls in Rust.

Read Post

Instrumenting Rust TLS with eBPF

Mar 17, 2026 | By Nikolay Sivko

Coroot is an open source observability tool that uses eBPF to collect telemetry directly from applications and infrastructure. One of the things it does is capture L7 traffic from TLS connections without any code changes, by hooking into TLS libraries and syscalls. Works great for OpenSSL. Works for Go. Then rustls enters the picture and everything stops being obvious. With OpenSSL, everything is nicely wrapped: From eBPF’s point of view this is perfect: Everything happens inside one call.

Read Post

Let's make alerting great again

Feb 26, 2026 | By Nikolay Sivko

No one has time to watch dashboards all day. Alerts exist to tell us when something goes wrong or is starting to go wrong, so we can act early. In theory, it sounds simple. Define a rule, set a threshold, get notified when it is crossed. In practice, it rarely works that smoothly.

Read Post

How to Reduce Your Cloud Costs with Coroot

Nov 26, 2025 | By Alexander Lamberton

Cloud costs often grow quietly until they suddenly command everyone’s attention. Gartner estimates that companies overspend on cloud services by up to 70 percent, mostly because they lack clear visibility into where the money is actually being spent. Cloud invoices speak the language of infrastructure: nodes, instance types, regions, volumes, and egress. Engineering teams speak the language of services, deployments, and code.

Read Post

Memory stall: the agony before OOM

Sep 23, 2025 | By Nikolay Sivko

When we set a memory limit for a container, the expectation is simple: if the app leaks memory, the OOM killer steps in, the container dies, Kubernetes restarts it, done. But reality is messier. As a container gets close to its memory limit, allocations don’t just fail instantly. They get slower. The kernel tries to reclaim memory inside the cgroup, and that takes time. Instead of being killed right away, your app just crawls.

Read Post

Instrumenting the Node.js event loop with eBPF

Sep 19, 2025 | By Nikolay Sivko

Recently, I was testing Coroot’s AI Root Cause Analysis on failure scenarios from the OpenTelemetry demo. One of them, loadgeneratorFloodHomepage, simulates a flood of excessive requests. As expected, it caused a latency degradation across the stack. Coroot’s RCA highlighted how the latency cascaded through all dependent services. At the same time, we noticed a moderate increase in CPU usage for the frontend service and the node itself.

Read Post

How eBPF Improves Open Source Observability

Feb 6, 2026 | By Coroot

Try it open source on your system. Learn how tools can make gathering and making sense of observability data instant and painless with co-founder Peter Zaitsev.

View Video

How to Fix DNS Problems

Jan 30, 2026 | By Coroot

All the problems that could go wrong with DNS and why "It's always a Freaking DNS Issue" according to DevOps movement co-founder and open source advocate Kris Buytaert.

View Video

What is DevOps?

Jan 29, 2026 | By Coroot

Learn what DevOps is from a founder of the movement, Co-founder of DevOpsDays, O11y, and Inuits, FOSS advocate: Kris Buytaert.

View Video

What Are the Pilllars of Observability?

Jan 16, 2026 | By Coroot

Understand the four pillars of observability (metrics, logs, traces, and profiles) with Co-founder Peter Zaitsev.

View Video

Observability Beyond Kubernetes: eBPF Magic

Jan 6, 2026 | By Coroot

Alex chats with Kris Buytaert, Co-founder of DevOpsDays, O11y, Inuits and pivotal instigator of the movement about why he loves using Coroot.

View Video

Observability vs. Monitoring

Jan 2, 2026 | By Coroot

Co-founder of DevOps Days, O11y, Inuits, and pivotal instigator of the DevOps movement Kris Buytaert explains why “observability best practices” starts with functioning monitoring and common mistakes to avoid.

View Video

EP #3: Cloud, Kubernetes, and the Evolution of DevOps - The Open Source Observability Podcast

Jan 2, 2026 | By Coroot

Kris Buytaert is the Co-founder of Inuits, O11y, and ‘DevOps Days,’ an internationally-attended series of DevOps events. He is a passionate advocate of Free and Open Source Software, and is accredited by the community as being a founding instigator of the DevOps movement. In this episode we trace the history of the DevOps movement from its intersection with open source and Agile, through the evolution of Cloud technologies and tools such Docker and Kubernetes, to present day best practices for CI/CD, monitoring, and observability.

View Video

DevOps AI Tools: Root Cause Analysis + eBPF + Clickhouse

Dec 16, 2025 | By Coroot

Watch Coroot’s Root Cause Analysis AI pinpoint the exact cause of an incident and suggest fixes in seconds.

View Video

Improve Your Observability With This CPU Metric

Dec 9, 2025 | By Coroot

🐧🐝 Learn what classic CPU metrics are (Load average, Node usage, and Container CPU usage) and why Delay Accounting can provide better, kernel-level insights into your system: https://t.ly/HQrWx

View Video

Faster, Simpler Root Cause Analysis with AI

Dec 8, 2025 | By Coroot

Incidents can quickly become costly, and digging through overwhelming amounts of telemetry can take hours. AI-Powered Root Cause Analysis automatically identifies the root cause of an incident and suggests fixes in seconds, so your team can get back to development (or if they’re on call at 3am, back to sleep.)

View Video

Monthly Archive

Follow Us