Operations | Monitoring | ITSM | DevOps | Cloud

Puppet Control Repository: Your Source of Truth for Infrastructure Management

Learn the fundamentals of Puppet's Control Repository with Margaret and Tony in this comprehensive walkthrough. See how Control Repos serve as your single source of truth for managing configuration across your entire infrastructure, driving collaboration and standardization while simplifying code deployments.

Behind Megaport's Network Automation Platform

We’ve teamed up with the Heavy Networking podcast to take you under the hood of Megaport’s resilient, software-driven network. Luke Gollan, Network Automation Engineer at Megaport, joins Heavy Networking hosts Ethan Banks and Drew Conry-Murray to unpack what happens when you click “provision” in the Megaport portal.

Behind the Dashboard: How to monitor your LLM integrations

Behind the Dashboard is an ongoing series where we look under the hood of a specific Catchpoint feature. Each episode breaks down the technology itself, what’s challenging about using it for monitoring, and how we removed friction and toil to make it a valuable part of the Catchpoint platform. In this episode Leon, Mursi, and Rahul take a look at Catchpoint’s LLM monitoring capabilities, including ensuring your integrated LLMs are up and performing optimally; as well as knowing if you’re using the most effective (accurate) and economical (cheapest per query) option in your suite.

Meet Canvas: Your AI-guided Workspace Within Honeycomb

Modern systems are wonderfully capable, but relentlessly complex. Debugging across microservices, frontends, and cloud edges often means switching between five or more tools, trying to stitch together “what changed” and “why it broke.” Honeycomb’s wide events model has proven to be a superpower for taming that complexity, by allowing you to easily observe and query end-to-end traces without worrying about how much granular data you attach to your events.

Breaking Free from SQLite - Why We Added PostgreSQL Support to SigNoz

"Let us support different relational databases apart from SQLite. Nobody likes to run SQLite in production." This was one of the most requested features from our community. Your requests have been heard, and we've added support for different relational databases, starting with PostgreSQL. If you're self-hosting SigNoz, you no longer need to worry about SQLite's limitations. Let's dive into what we've built and why it matters for your production deployments.

Debug, query, and build faster with AI: How we use Grafana Assistant at Grafana Labs

We recently released Grafana Assistant into public preview for Grafana Cloud, and we’ve been excited to see how our customers have already made it part of their daily observability routines. At the same time, Assistant is becoming a go-to companion for developers right here at Grafana Labs, whether they’re debugging on-call issues, helping customers, or trying to remember tricky PromQL syntax.

DevOps Guide to Monitoring in Serverless Applications

Serverless computing helps teams move faster by removing the need to manage servers. Code runs only when needed, scaling up or down automatically. For DevOps engineers, this means quicker deployments and less infrastructure work. But serverless also brings new challenges. Functions run for short periods, making it hard to track errors, performance, and costs.

FireHydrant 4-Minute Demo

Get a quick walkthrough of the FireHydrant platform. FireHydrant is the all-in-one incident management platform that helps teams resolve incidents up to 90% faster — and prevent them from happening again. From flexible alerting and powerful automation to retros and AI insights, it brings clarity and control to every step of your response.

Pastries with SREs: Limitless observability and uncompromised donuts

In this episode of Pastries with SREs, we dig into Limitless Observability with a sweet side of unified observability strategy. If you're tired of siloed tools, fractured data, and swivel-chair investigations, this one’s for you. We explore: Why are silos still the norm in modern observability? What’s the true cost of inefficiencies across logs, metrics, and traces? How can SREs, IT operations, and dev teams shift to a no-compromise, unified observability model?

Diskless 2.0: Unified, Zero-Copy Apache Kafka

We’ve added Tiered Storage to Diskless Kafka—using plain old KIP-405 as the read-optimizer, Diskless Kafka materializes fast-to-read segments—unifying Tiered and Diskless into a single path. This leverages production-grade Tiered Storage plugin, removes the need for bespoke components, and simplifies the community discussion. We’ve also upgraded KIP-1150 and KIP-1163 to address the community’s most pressing questions such as transactions and queues support.