Operations | Monitoring | ITSM | DevOps | Cloud

Mastering NodeJS Performance Monitoring - A Practical Guide using Open Source Tools

Node.js powers some of the fastest-growing web applications, but its single-threaded nature makes it vulnerable to memory leaks and CPU spikes. To keep your app running smoothly, especially in production, you need more than just web server logs — you need complete visibility across the entire stack.

Top 15 Distributed Tracing Tools for Microservices in 2025

In one of our previous blogs, we discussed distributed tracing in depth. We examined why distributed tracing is critical and its components - spans and trace context. You can check the complete guide here: What is Distributed Tracing and How to Implement it with Open Source? Here, we'll look at some of the best distributed tracing tools. We'll see what each of them offers so that you can choose the right tool for your monitoring and observability requirements.

Top 13 Open Source APM Tools [2025 Guide]

Choosing the right APM tool is critical. How do you know which is the right one for you? Here are the top 13 open-source application performance monitoring(APM) tools that can solve your monitoring needs. Open-source APM tools have added benefits over their SaaS counterparts. They are more transparent, as you can verify their source code, and you can use them without going through the pains of obtaining approvals usually required for using a third-party vendor tool.

Top 5 Open Source Log Management Tools (and How to Choose the Right One)

Managing logs at scale is no longer just about storing text—it’s about gaining insights fast, keeping systems healthy, and troubleshooting in real time. With cloud-native architectures becoming the norm, the pressure is on for modern teams to adopt log management tools that are fast, scalable, and easy to use. But with so many options, how do you choose the right one?

Easy Method for Monitoring MinIO Performance Using Telegraf

MinIO is a high-performance, S3-compatible object storage server built for cloud-native applications. It’s open-source, lightweight, and incredibly fast which makes it a solution for developers who need to store and serve unstructured data like images, logs, or backups. Whether you’re building a self-hosted alternative to Amazon S3 or running MinIO as part of a local development pipeline, it fits into modern containerized environments.

Monitoring for Financial Services: Reducing Costs, Ensuring Reliability

Fintech has reshaped financial services, using technologies like machine learning and blockchain to deliver faster, smarter, more user-friendly experiences. Challenger banks, open banking apps, digital payments, and investment apps have set a new standard—leaving traditional institutions racing to keep up. But staying competitive isn’t just about building digital products—it’s about making them reliable.

Database observability: How OpenTelemetry semantic conventions improve consistency across signals

Databases are a crucial part of modern systems, which means database observability is incredibly important, too. However, gathering information on them can be complex, variable, and tricky to instrument in a consistent way. OpenTelemetry is helping to change that, and one of the most important aspects in making it work is a set of shared rules called semantic conventions.

Create rich, up-to-date visualizations of your AWS infrastructure with Cloudcraft in Datadog

As your cloud environment grows more complex and dynamic, it becomes more difficult to maintain up-to-date reference diagrams, visualizing its components, that are available to all teams. As a result, teams often end up lacking the visibility they need to understand, manage, and troubleshoot their cloud infrastructure and applications.

From Downtime to Uptime: Monitoring Tools and Techniques for Systems, Websites, APIs, and More

Recently, while visiting a friend in a local hospital, I found myself facing a frustrating distraction: trying to pay parking fees using USSD (a mobile text-based system for quick transactions). The service was either painfully slow or not working at all. I wasn’t alone. Other visitors were just as exasperated, and parking attendants stood idle, their handheld devices frozen in endless loading loops.

The Brain Behind the Pings: Understanding the Pingmesh Control Plane

In today’s interconnected world, a fundamental question plagues every network administrator and SRE: “Is my network running well?” The answer, often elusive, is precisely what Pingmesh aims to provide. By deploying a vast fleet of specialized probe agents, Pingmesh continuously monitors critical network health metrics, including latency, packet loss, jitter, and custom reachability checks, providing an unparalleled view into your network’s performance.