Operations | Monitoring | ITSM | DevOps | Cloud

Grafana Cloud updates: The latest features in Kubernetes Monitoring, Fleet Management, and more

We consistently roll out helpful updates and fun features in Grafana Cloud, our fully managed observability platform powered by the open source Grafana LGTM Stack ( Loki for logs, Grafana for visualization, Tempo for traces, and Mimir for metrics). In case you missed them, here’s our monthly round-up of the latest and greatest Grafana Cloud updates.

Trace Distributed Map states for AWS Step Functions with Datadog

AWS Step Functions offers the Distributed Map state, enabling you to coordinate massively parallel workloads within your serverless applications. With this feature, a single Step Functions execution can fan out into up to 10,000 parallel workflows simultaneously, making it possible to efficiently process millions of items in parallel. This capability unlocks new possibilities for large-scale data processing, such as image transformation, log ingestion, or batch analytics.

Now you can use Sentry Insights to trigger alerts and debug issues

You deploy a fix late Friday and spend the weekend refreshing dashboards, hoping nothing breaks. You shouldn’t have to babysit a dashboard to know when something’s wrong. With the latest updates to Insights, you can now create alerts directly from any chart. Whether it’s a spike in 4xx errors after a deploy, a jump in P95 latency for an API endpoint, or a drop in throughput for a background job, you can set up alerts with just two clicks.

How Sentry's Seer AI Agent passes legal review: a guide for legal teams reviewing Seer

If your legal department is anything like ours, you’re being inundated with requests from the business to use more and more AI tools. Whether it's developers wanting to use coding agents like Cursor, to security implementing AI-driven investigations, to sales and marketing leveraging AI for call insights and competitive research, we've seen a shift in what teams are trying and buying.

Agentic ITOps: The smarter alternative to outsourcing L1 operations

The complexity of modern enterprises has pushed IT operations to the limit. Hybrid cloud environments, CI/CD pipelines, microservices, and agile methodologies revolutionized IT, but caused an explosion of scale and data fragmentation. This complexity simply cannot be managed by legacy tools or manual ITSM processes designed for monolithic systems and static infrastructures.

How One MSP Used AI to Cut Noise by 78% and Reclaim Engineering Time

An operations team at one of the Asia-Pacific’s largest managed service providers (MSPs) was drowning in their own success. Years of investment in monitoring tools and automation had created comprehensive visibility—and comprehensive chaos. Engineers opened dashboards each morning to find thousands of alerts waiting, with critical incidents buried somewhere inside. The scale of the problem was overwhelming their capacity to respond effectively.

Security and Compliance Takes Center Stage: Key Insights from Open Source Finance Forum - London 2025

We’ve just wrapped up London’s 2025 Open Source Finance Forum (OSFF) in London and in this blog I’ll try to capture the key highlights from this year’s event while they’re still fresh. Dominant themes were the increasing prominence of legislation and governance frameworks, and what these mean for developers and practitioners.

See more, solve more with end-to-end network path tracing

Few things hold IT teams back more than a lack of visibility. It’s exponentially harder to solve issues when they originate in parts of the environment you can’t see. That’s one of the big limitations of native tools for monitoring and managing Microsoft Teams. Microsoft Call Quality Dashboard, Admin Center and Service Dashboard, and Meeting Room Pro Dashboard are all constrained to the aspects of Teams that Microsoft controls directly.

Why synthetic testing is the secret to proactive Teams management

The more organizations depend on collaboration solutions like Microsoft Teams for productivity, the more IT departments are expected to ensure a seamless experience every time. That demands more than just rapid troubleshooting when issues occur: it requires IT teams to get ahead of problems and keep them from affecting users in the first place. For that, synthetic testing is a must.