Operations | Monitoring | ITSM | DevOps | Cloud

%term

Calico eBPF Source IP Preservation: The Unexpected Story of High Tail Latency

The Calico eBPF data plane is your choice if latency is your primary concern. It was very disturbing that some benchmarking brought to our attention that eBPF had higher tail latency than iptables. The 99+% percentiles were higher by as much as a few hundred milliseconds. We did a whole bunch of experiments and we could not crack the nut until we observed that there are some occasional and unexpected TCP reset (RST) packets, but no connections were reset.

The critical role of Kafka monitoring in managing big data streams

Apache Kafka is the backbone of modern data streaming architectures, enabling real-time data movement, stream processing, and event-driven applications at scale. It enables high-throughput messaging between data sources and analytics platforms, supports log aggregation, and facilitates scalable extract, transform, load (ETL) pipelines for continuous data transformation and storage.

5 strategies to reduce false alerts in server monitoring

There are two types of alerts you don't want: We call these false alerts. As a person with responsibility over your IT infrastructure, it is natural that you have configured your monitoring systems to alert you at every step. But when these false alerts take up too much of your time, one of these unfortunate scenarios may occur: Let's explore more about false alerts before we dive into five strategies to avoid them.

Accelerate Network Incident Response With AppNeta, Automic Automation, and ConnectALL

Enabling accurate exchange of information between key applications has become crucial in today’s hybrid and complex IT operations. When we speak with potential customers, one common question we hear is, “How easy is it to consume and integrate the insights generated by Network Observability by Broadcom?” This might sound like table stakes, but it is often a challenge due to siloed teams, the high levels of expertise required, different data formats, and time-consuming processes.

Get to Know JFrog ML

AI/ML development is getting a lot of attention as organizations rush to bring AI services into their business applications. While emerging MLOps practices are designed to make developing AI applications easier, the complexity and fragmentation of available MLOps tools often complicates the work of Data Scientists and ML Engineers, and lessens trust in what’s being delivered.

An Introduction to Absinthe for Elixir Monitoring with AppSignal

Absinthe is a popular GraphQL toolkit for building robust APIs in Elixir. Monitoring such APIs is essential to ensure performance, detect bottlenecks, and handle errors effectively. AppSignal offers a seamless way to monitor and gain insights into your Absinthe-powered GraphQL APIs, enabling you to keep applications performant and reliable.

Python Logging Exceptions: The Setup Guide You Actually Need

Debugging a Python app can be frustrating, especially when an unexpected crash leaves behind nothing but a vague error message. A well-configured exception log can make all the difference, turning guesswork into clear insights. Here’s how to set up logging that actually helps.