Latest Posts

The hard part of AI root cause analysis is no longer the model

Jun 30, 2026 By Nikolay Sivko In Coroot

Every few weeks someone tells me root cause analysis is a solved problem now: pipe your telemetry into an LLM, let it tell you what broke. I wish it were that easy. After years on this, I think "can AI do RCA?" is the wrong question, because doing RCA with an LLM is really two separate jobs, and the answer is different for each. They break in completely different ways, so it's worth pulling them apart.

Read Post

Coroot

Read more about The hard part of AI root cause analysis is no longer the model

Observability on Windows, before eBPF is production-ready

Jun 23, 2026 By Nikolay Sivko In Coroot

No large enterprise runs a single stack. A shiny new Kubernetes cluster sits right next to a Windows Server box that has quietly run the billing system for a decade without missing a beat. Both keep the business running. Both deserve the same visibility. Linux runs most server workloads, and Coroot grew up there. Our open-source node-agent uses eBPF to collect metrics, logs, traces, and profiles, with no code changes. But "most" is not "all".

Read Post

Coroot

Read more about Observability on Windows, before eBPF is production-ready

Zero-config Go heap profiling

Apr 27, 2026 By Nikolay Sivko In Coroot

Coroot's node-agent already collects CPU profiles for any process on the node using eBPF, with zero integration from the application side. For Java, we dynamically inject async-profiler into the JVM to get memory and lock profiles. But Go processes were still a blind spot for non-CPU profiling unless the app exposed a pprof endpoint and the cluster-agent scraped it. We wanted the same zero-config experience for Go heap profiles. This post is about how we got there.

Read Post

Coroot

Read more about Zero-config Go heap profiling

Profiling Java apps: breaking things to prove it works

Apr 3, 2026 By Nikolay Sivko In Coroot

Coroot already does eBPF-based CPU profiling for Java. It catches CPU hotspots well, but that's all it can do. Every time we looked at a GC pressure issue or a latency spike caused by lock contention, we could see something was wrong but not what. We wanted memory allocation and lock contention profiling. So we decided to add async-profiler support to coroot-node-agent. The goal: memory allocation and lock contention profiles for any HotSpot JVM, with zero code changes. Here's how we got there.

Read Post

Coroot

Read more about Profiling Java apps: breaking things to prove it works

Making encrypted Java traffic observable with eBPF

Mar 23, 2026 By Nikolay Sivko In Coroot

Coroot's node agent uses eBPF to capture network traffic at the kernel level. It hooks into syscalls like read and write, reads the first bytes of each payload, and detects the protocol: HTTP, MySQL, PostgreSQL, Redis, Kafka, and others. This works for any language and any framework without touching application code. For encrypted traffic, we attach eBPF uprobes to TLS library functions like SSL_write and SSL_read in OpenSSL, crypto/tls in Go, and rustls in Rust.

Read Post

Coroot

Read more about Making encrypted Java traffic observable with eBPF

Instrumenting Rust TLS with eBPF

Mar 17, 2026 By Nikolay Sivko In Coroot

Coroot is an open source observability tool that uses eBPF to collect telemetry directly from applications and infrastructure. One of the things it does is capture L7 traffic from TLS connections without any code changes, by hooking into TLS libraries and syscalls. Works great for OpenSSL. Works for Go. Then rustls enters the picture and everything stops being obvious. With OpenSSL, everything is nicely wrapped: From eBPF’s point of view this is perfect: Everything happens inside one call.

Read Post

Coroot

Read more about Instrumenting Rust TLS with eBPF

Let's make alerting great again

Feb 26, 2026 By Nikolay Sivko In Coroot

No one has time to watch dashboards all day. Alerts exist to tell us when something goes wrong or is starting to go wrong, so we can act early. In theory, it sounds simple. Define a rule, set a threshold, get notified when it is crossed. In practice, it rarely works that smoothly.

Read Post

Coroot

Read more about Let's make alerting great again

How to Reduce Your Cloud Costs with Coroot

Nov 26, 2025 By Alexander Lamberton In Coroot

Cloud costs often grow quietly until they suddenly command everyone’s attention. Gartner estimates that companies overspend on cloud services by up to 70 percent, mostly because they lack clear visibility into where the money is actually being spent. Cloud invoices speak the language of infrastructure: nodes, instance types, regions, volumes, and egress. Engineering teams speak the language of services, deployments, and code.

Read Post

Coroot

Read more about How to Reduce Your Cloud Costs with Coroot

Memory stall: the agony before OOM

Sep 23, 2025 By Nikolay Sivko In Coroot

When we set a memory limit for a container, the expectation is simple: if the app leaks memory, the OOM killer steps in, the container dies, Kubernetes restarts it, done. But reality is messier. As a container gets close to its memory limit, allocations don’t just fail instantly. They get slower. The kernel tries to reclaim memory inside the cgroup, and that takes time. Instead of being killed right away, your app just crawls.

Read Post

Coroot

Read more about Memory stall: the agony before OOM

Instrumenting the Node.js event loop with eBPF

Sep 19, 2025 By Nikolay Sivko In Coroot

Recently, I was testing Coroot’s AI Root Cause Analysis on failure scenarios from the OpenTelemetry demo. One of them, loadgeneratorFloodHomepage, simulates a flood of excessive requests. As expected, it caused a latency degradation across the stack. Coroot’s RCA highlighted how the latency cascaded through all dependent services. At the same time, we noticed a moderate increase in CPU usage for the frontend service and the node itself.

Read Post

Coroot

Read more about Instrumenting the Node.js event loop with eBPF

Operations | Monitoring | ITSM | DevOps | Cloud

The hard part of AI root cause analysis is no longer the model

Observability on Windows, before eBPF is production-ready

Zero-config Go heap profiling

Profiling Java apps: breaking things to prove it works

Making encrypted Java traffic observable with eBPF

Instrumenting Rust TLS with eBPF

Let's make alerting great again

How to Reduce Your Cloud Costs with Coroot

Memory stall: the agony before OOM

Instrumenting the Node.js event loop with eBPF

Monthly Archive

Follow Us