Operations | Monitoring | ITSM | DevOps | Cloud

Grafana Cloud updates: redesigned dashboard filters, more ways to use RBAC, and more

We consistently roll out helpful updates and fun features in Grafana Cloud, our fully managed observability platform powered by the open source Grafana LGTM Stack (Loki for logs, Grafana for visualization, Tempo for traces, and Mimir for metrics). In case you missed them, here’s our monthly roundup of the latest and greatest Grafana Cloud updates. You can also read about all the features we add to Grafana Cloud in our What’s New in Grafana Cloud documentation.

Avoid Rate Limiting with Query Batching

This post is part of our debugging series, where we share tricky challenges and solutions while building Sentry. On March 4th, 2024, the most metal incident happened - INC-666 INC-666, in a nutshell, was where the issue alert rule post-processing step was flooded with more load than it could handle, and alerts that were supposed to have fired did not. This means that Sentry customers might not be receiving alerts if the query that would have triggered the alert is rate-limited.

Use the Telegraf Exec Plugin to Convert Data Formats

Converting multiple data formats into one unified format makes software and DevOps monitoring so much easier, as it brings together all types of metrics for a smoother, more consistent analysis. This approach cuts down on the need for separate parsing setups, saving time and reducing complexity when it comes to managing configurations. It’s also a big help for scaling up—your monitoring tools can handle growing volumes of data without constant adjustments.

Key Takeaways from the 2024 DORA Report

Google recently released its 2024 Cloud DORA (DevOps Research and Assessment) report, bringing together a decade’s worth of trends, insights, and best practices on what drives high performance in software delivery across industries of all sizes. This year’s findings take a closer look at how DevOps teams can achieve greater resilience and efficiency by adopting AI, improving team well-being, and building powerful internal platforms. ‍

VictoriaMetrics Efficiently Simplifies Log Complexity with VictoriaLogs

Salt Lake City, Utah, 13th November 2024 – Today we’re delighted to announce the GA release of our innovative logging solution - VictoriaLogs. Our easy-to-use, open source log management solution combines a powerful query language for easy log searching with minimal resource requirements. It’s perfect for managing and analyzing large volumes of log data, especially in containerised environments such as Kubernetes.

An Engineer's Guide to Making Sense of Log Data

In the webinar, the experts explained why a log management strategy is crucial if you want to accurately assess the health and compliance of your applications. Topics include: Cloud native technologies have made it harder to understand how systems are behaving. Logs are the answer, but they can be voluminous and complex in any environment. How do you make sense of them?

The Ultimate Guide to AWS Logging: Tools, Types, and Techniques

AWS logs are fundamental for organizations to conduct performance analysis, troubleshooting, security monitoring, and adhere to compliance requirements. But if you’re using more than one AWS service you can quickly realize that your logs are expanding out of control across decentralized locations. Therefore it’s crucial that you can process and analyze all your AWS logs within a single centralized repository.

Key Availability and Uptime Metrics, Stats, and KPIs You Should Monitor and Report On

What are availability and uptime metrics and why should you measure them? In the past, development teams pushed new features, and operations teams handled issues as they arose. However, as more businesses pivot to a DevOps infrastructure, all IT teams work side by side throughout an application’s lifecycle, from coding and testing to deployment and monitoring.