Operations | Monitoring | ITSM | DevOps | Cloud

Flyway code analysis - These are a few of my favorite rules

Clean, consistent SQL code isn’t just a preference. It’s a pathway to healthier applications, faster debugging, and happier teams. Whether you’re onboarding new developers or optimizing legacy systems, having a clear set of standards can make all the difference. Flyway’s code analysis is a powerful ally in keeping your database code tidy and secure, and making sure best practices are being followed.

Risk Register for SREs: A Practical Guide to Proactive Incident Prevention

A risk register is one of the most powerful tools in an SRE's arsenal for maintaining system reliability. By systematically documenting potential threats to your infrastructure and services, you can shift from reactive firefighting to proactive risk management.

Set Up ClickHouse with Docker Compose

ClickHouse is built for high-performance OLAP workloads, capable of scanning billions of rows in seconds. If your analytical queries are bottlenecked on PostgreSQL or MySQL, or you're burning too much on Elasticsearch infrastructure, ClickHouse offers a faster and more cost-efficient alternative. This blog walks through setting up ClickHouse locally with Docker Compose and scaling toward a production-grade cluster with monitoring in place.

Stream AWS Metrics to Grafana with Last9 in 10 minutes

It’s 2:47 AM and your Lambda functions are timing out. API response times are spiking. You’re flipping between the CloudWatch console, your APM tool, and your logs, trying to figure out what’s going wrong. CloudWatch has the metrics you need: CPU usage, memory pressure, and request rates — but connecting that data to what your app is doing takes time. The delay in stitching it all together slows down your incident response.

I built an MCP Server for Observability. This is my Unhyped Take

Recently, I read a blog titled “It’s The End Of Observability As We Know It (And I Feel Fine)”, which discussed MCP servers in observability and how these systems would potentially be the “end of observability”. As someone who has spun up an MCP server for an observability backend and as someone who has been in the space for a while, I certainly do not think so.

Cloud or Self-Hosted - Which Deployment Model is Right For You?

Choosing the right observability platform is a critical decision. But how you deploy it is just as important. The right deployment strategy can accelerate your team, simplify operations, and ensure you meet compliance and security requirements. The wrong one can lead to operational headaches and slow you down. At SigNoz, we believe in flexibility. There is no single "best" way to deploy an observability platform; there's only the way that's best for you.

Apple WWDC25 Announcement of Enterprise IT Enhancements

At WWDC25, Apple announced a set of updates to simplify IT management for enterprises. These updates, spread across macOS 26, iOS 26, iPadOS 26, tvOS 26 and visionOS 26, introduce practical tools to improve device, application and user management. This article outlines the specific capabilities and how they can be applied effectively in enterprise environments.

Being on-call at incident.io

At incident.io, we are building a product that our users rely on 24/7, all year round. This means it is crucial that it is always working, and that is where our on-call rotation comes in. We believe that everyone should be on-call because it tightens the feedback loop between shipping new features and maintaining what we have, leading to more pragmatic engineering decisions.

How to monitor your Laravel app for critical vulnerabilities using Oh Dear

A critical security vulnerability was recently discovered in Livewire v3 that allows remote code execution, as Stephen Rees-Carter reported on Securing Laravel. While patches are released quickly, many applications remain vulnerable because developers simply don't know about the issue yet. Oh Dear's Application Health monitoring solves this by continuously checking your production environment for security vulnerabilities and immediately notifying you when issues are detected.