Operations | Monitoring | ITSM | DevOps | Cloud

CI/CD Observability Powered by OpenTelemetry

Modern engineering teams spend a lot of time and resources in setting up monitoring of their production systems - tracking uptime, catching errors, and responding to incidents before customers ever notice. But what about the journey before code reaches production? For most teams, observing the CI/CD pipeline is either an afterthought or completely overlooked. While we recognize its importance, do we truly understand how well our CI/CD process is functioning?

Monitoring your MCP Server in Production (with Sentry)

So you're building an MCP server for your project or service, to allow AI chatbots and agents to interact with it? Great! You've decided to build it using Cloudflare Workers, have written the code, shipped it, and the first users are getting onboard: you're officially running it in production. That's when problems start. I'm not here to dissuade you from shooting your shot, but let's make sure you've got your bases covered in production when something inevitably goes wrong.

Linux Security Logs: Complete Guide for DevOps and SysAdmins

Security logs are the quiet sentinels of your Linux systems, recording critical information that can mean the difference between detecting an intrusion and discovering a breach months too late. For most DevOps professionals and system administrators, these logs contain valuable insights that often go untapped. While they're essential for compliance, their real value lies in providing visibility into your system's security posture and operational health.

Using DCIM to Drive Down Data Center Energy Costs

Data centers are energy-intensive, and with the surge in AI-driven workloads, their global energy consumption is projected to more than double by 2030, potentially surpassing the current electricity consumption of Japan. For most data center operators, energy is one of their largest recurring expenses. As demand for data center capacity continues to grow and energy prices fluctuate, energy efficiency is no longer just a sustainability goal, it's a core business concern.

You, Me, and BugSplat's MCP

Let's face it - from an experienced developer's perspective, most software trends are, put lightly, incredibly annoying. The last thing a grizzled, old, technical wizard wants to hear is some half-brained junior developer telling them to switch their SQL server to MongoDB, replace the PHP EC2 with serverless Python, or rewrite their entire front-end with HTMX. The hype-train is so intense that even watching TV feels risky, as you might see something as absurd as an ad for AI toothpaste.

What's Holding Back AI Adoption in India?

Earlier this year, I spent a few weeks in India, visiting universities, speaking at meetups, and catching up with founders. What stood out wasn’t just the excitement about AI, but the focus on what it can actually do today. The curiosity about GenAI and big-picture questions around AGI is there, but most conversations centered around real needs: learning faster, applying for jobs, and getting healthier.

Prometheus vs Zabbix: A Hands-On Technical Comparison and a Modern Alternative

When choosing a monitoring tool, two popular names often come up, Prometheus and Zabbix. Both are powerful and widely adopted but come with different approaches and learning curves. Prometheus is favored in cloud-native environments for its time-series data model and flexibility, while Zabbix has long served traditional IT infrastructures with its rich agent-based monitoring. But what if you are looking for a simpler, more unified solution?

Your Observability Platform Has a Blind Spot: Don't Risk Your Operations on Bolt-on Incident Response Modules

Observability platforms want to do it all—from data collection to incident response. Their pitch is appealing: one platform to eliminate context switching and reduce overhead. But when critical systems fail—and they will fail—, add-on incident management modules won’t save you. You need an end-to-end system built specifically for high-stakes incident management.

Tracealyzer Was Just the Beginning

If you’ve been building embedded systems for a while, chances are you know Percepio for Tracealyzer. And we’re proud of that. For over a decade, Tracealyzer has been helping engineers visualize and solve complex RTOS issues faster, with over 30 ways to slice and understand system behavior. But in 2025, embedded systems demand more. They’re always on. Always connected. And increasingly, always business-critical.