Operations | Monitoring | ITSM | DevOps | Cloud

Contextual, in-product guidance for every Grafana user: A closer look at Interactive Learning

As developer advocates at Grafana Labs, we’re always looking for new ways to help our users better understand and learn observability. You might remember our previous project that brought learning to life through an adventure-style game, and now we’re really excited to share something else we’ve been working on: Interactive Learning, a new way to get the technical help you need directly in Grafana.

New Feature: Filter HTTP Pings by Keywords

Healthchecks.io can now classify HTTP pings from clients as start, success, or failure signals not only by URL suffixes (no suffix, /start, /fail, /{exit-status}) but also by looking for specific keywords or phrases in the HTTP request body. The content filtering feature was already available for email pings, and now it has been extended to HTTP pings as well.

Bitbucket's new look: user experience and navigation updates coming soon

We’re giving Bitbucket a fresh new look and more streamlined navigation as part of Atlassian’s broader visual system journey. Teams and workflows have improved, and Bitbucket is changing with them. Our goal is to make it faster to find your work, clearer to understand what’s happening, and more enjoyable to use every day—without disrupting what you already know and love. This update aligns Bitbucket with Atlassian’s modern, unified design, and will launch in early 2026.

Managing cloud infrastructure with AI assistant and Upsun MCP server

Artificial intelligence is changing the way we execute our everyday operations. AI assistants are incredibly intelligent; they can write code, explain complex concepts, and answer any question you throw at them. However, they can't execute actions on their own. If you ask your AI assistant to “create a backup of my database,” it may provide you with clear instructions, run the CLI commands directly or in some cases, even trigger actions through connected agent workflows.

Mastering AI Spend With CloudZero And LiteLLM

The AI landscape today feels a lot like the early days of the cloud: exciting, fast-moving, and completely fragmented. Every week, engineering teams are experimenting with dozens of large language models (LLMs) from providers like OpenAI, Anthropic, Google, Mistral, Meta, and beyond. They’re tweaking prompts, testing model performance, swapping context windows, and even running multiple models in parallel to figure out which one works best for each unique use case.

Patterns for Deploying OpenTelemetry Collector at Scale

So, you've embraced OpenTelemetry, and it's been great. Pat, Pat. That single, vendor-neutral pipeline for your traces, metrics, and logs felt like the future. But now, the future is getting bigger. That simple OTel Collector configuration that worked perfectly for a few services is starting to show its limits as you scale. The data volume is climbing, reliability is becoming a concern, and you're wondering if that single collector instance is now a bottleneck waiting to happen.

5 Steps To A Thriving Business

Creating a thriving business is one of those things that all sound so simple when you say it out loud, but in practice, it can be a wild jumble of clarity, doubt, small wins, and that little 'what am I even doing?' moment. And I've found that growth doesn't always feel like growing as it happens. Sometimes it registers as a softer kind of chaos that unfolds gradually. Even so, there are steps that do tend to nudge things in the right direction, even if you follow them a bit imperfectly.