Operations | Monitoring | ITSM | DevOps | Cloud

Pastries with SREs: Leveling up observability and donut dunkability

In this episode of Pastries with SREs, we explore what it really means to shift left with observability, moving from reactive firefighting to proactive performance. And yes, it starts with donuts. We unpack how SREs and IT Ops teams are often stuck reacting to incidents, battling alert fatigue and swivel-chair triaging. But what if you could pull in developers earlier, and give everyone a unified view of observability data?

Observability-as-Code: Bring synthetic monitoring into your pipeline

Your team just deployed to production. The infrastructure spun up in 90 seconds, but recreating your monitoring? That’ll take hours. It’s added late in the process, managed through dashboards, and prone to inconsistency. Short-term, this slows delivery and creates visibility gaps that surface only during incidents. Long-term, it leaves a business-critical capability out of your observability pipeline.

Signal Enrichment: Turning Noisy Alerts into Actionable Intelligence

This is the fourth post in our series on the future of incident management, which builds upon The Future of Incident Management: Your Blueprint for Operational Excellence, How Native Process Automation and Auto-Remediation Drive Operational Excellence, and Service Intelligence is the Future of Proactive Incident Management.

Bitbucket Dynamic Pipelines Creation and Deployment | Bitbucket Blitz | Atlassian

In this video, I introduce Bitbucket's Dynamic Pipelines. By watching this video, you'll learn how to create a Dynamic Pipeline using Atlassian's Forge tool and deploy it to your Bitbucket Cloud site. About Atlassian: Behind every great human achievement, there is a team. From medicine and space travel to disaster response and pizza deliveries, we help teams all over the planet advance humanity through the power of software. Our mission is to help unleash the potential of every team.

Automated RAG pipeline evaluation and benchmarking with RAGAS

Retrieval-Augmented Generation (RAG) pipelines have become an integral part of how Large Language Models (LLMs) access information beyond their training cutoff. These pipelines enable LLMs to deliver current, accurate, and grounded responses. By fetching relevant external documents, RAG mitigates common LLM challenges like factual inaccuracies and hallucinations. However, this methodology introduces a new complexity: evaluating RAG pipeline performance is particularly challenging.

The observability maturity curve: How IT leaders are shifting from tools to outcomes

Observability has come a long way from its origins in monitoring logs and metrics. Today, it sits on a maturity curve: Organizations move from fragmented tool stacks to unified platforms to proactive engineering practices that tie reliability to business outcomes. To better understand where IT leaders are on this curve, Grafana Labs surveyed 150 decision-makers across industries in advance of ObservabilityCON 2025.

How to automate sending SquaredUp dashboards to Slack with the Notification API

SquaredUp's existing notifications fire when monitors change state. With Notification API, you control the trigger. Send dashboards on a schedule, before meetings, or on-demand through chat commands. In this step-by-step guide, you’ll learn how to automate sending SquaredUp dashboards to Slack. I’ll use Power Automate as the example, but the same approach works with other automation tools such as Zapier, Make, n8n, or even a custom script, as long as it can send an HTTP request.

LLM Observability Explained: Prevent Hallucinations, Manage Drift, Control Costs

Large Language Models (LLMs) are transforming how businesses interact with users, automate workflows, and deliver insights in real time. But as powerful as these models are, running them at scale comes with unique challenges, from hallucinations and latency spikes to cost overruns and user trust issues.

How to Perform Ping Tests: Different Tools and Techniques

If you’re a remote worker struggling with video calls, or a gamer noticing lag, a quick Internet ping test using an online ping tester can give you a simple yes/no answer: Is my connection alive, and how fast does it respond?. But if you’re a network admin or IT professional, that’s just scratching the surface. Business networks are more complex beasts.