Operations | Monitoring | ITSM | DevOps | Cloud

How Grafana Labs unlocks the power of recruitment data with Grafana dashboards

As the recruitment team here at Grafana Labs, we used to struggle to get a comprehensive view of our recruitment data. We had multiple sources of information, but it was difficult to pool that information so we could see the big picture and identify trends and patterns that could help us hire the right talent in a highly competitive market.

Monitoring AWS DynamoDB performance and latency

Amazon DynamoDB is a fully managed NoSQL database service provided by AWS and is tailor-made for serverless applications. As a fully managed service, we don’t have to worry about operational tasks with DynamoDB, such as hardware provisioning, configuring instances, scaling, replications, software patching, etc.

SRE Trends from AWS re:Invent 2022

In November/December 2022 I attended AWS re:Invent in Las Vegas. It was certainly an experience for this small town kid from New Zealand, and one that I took a lot away from. While I was at the conference, I took the time to walk around and take notes. In this article I will share the trends that I observed which I think will have an impact on SRE work in 2023 and beyond, including: ...and others.

A Complete Guide to Google's Core Web Vitals and How to Optimize Them

The success of your website lies in how satisfied your users are with it. To help ensure the quality of your user experience, Google uses various signals from a web page. The three Core Web Vitals are some of the most important ones. In this article, I’ll talk about what each Core Web Vital means and how to optimize them to deliver a better user experience.

Optimize Application Performance with Code Profiling

When monitoring your application performance or troubleshooting an issue in production, context is key. The more information available, the faster the prevention of or detection of a user impacting issue. Observability tools offer many different features, like code profiling, to help contextualize your data. In this post, I’ll discuss what code profiling is and show an example of how it works.

Routing Strategies for Security and Observability Data: How to Make the Most of Your Data at Scale

Data routing is a crucial but complex task for companies of all sizes. Ensuring that the right data is sent to the right tools can be a time-consuming and difficult process, and when things go wrong, it can have costly consequences. This is why having a robust data routing strategy is essential for any organization.

An Introduction to AWS Monitoring with Prometheus and Logz.io

Prometheus is a widely utilized time-series database for monitoring the health and performance of AWS infrastructure. With its ecosystem of data collection, storage, alerting, and analysis capabilities, among others, the open source tool set offers a complete package of monitoring solutions. Prometheus is ideal for scraping metrics from cloud-native services, storing the data for analysis, and monitoring the data with alerts.

Business Continuity vs. Business Resilience: Comparing Strategies for Staying Resilient

If there is one thing organizations can take away from the past few years, it's that they are far more vulnerable than they could realize before. From pandemics to critical supply shortages to widespread data breaches and natural disasters, businesses that don’t have plans in place to handle and respond to emergencies are at tremendous risk. As leaders plan for inevitable crises and disruption, interest in business resilience and continuity grows.