Operations | Monitoring | ITSM | DevOps | Cloud

Summit Recap: How to adapt to a "Digital Everything" World

Every interaction with our customers, partners, and employees is special – but this year’s PagerDuty Summit went far beyond my wildest dreams. Together we committed to helping you learn and grow in how you manage business critical operations – in other words, getting you ready for anything in a world of Digital Everything.

Minimize MTTR to Mitigate Impact of Change Management

In the first blog this demo series, we showed you how to use Restorepoint to remediate after a network breach. In our second blog of this three-part series, we walk you through a change management instance—showing how to speed problem resolution and how to mitigate the impact of poor change management to minimize MTTR.

Your First Shipa Canary Deployment(s)

Lineage to the saying “canary in a coal mine”, the canary deployment/release methodology is an incremental release focused on safety. If the canary does not pass, the deployment does not continue or is rolled back. Taking a jog down memory lane, like Kubernetes the Hard Way, a few years ago a canary deployment in Kubernetes was quite the undertaking.

A quick guide to load testing Grafana Loki with Grafana k6

As a software engineer here at Grafana Labs, I’ve learned there are two questions that commonly come up when someone begins setting up a new Loki installation: “How many logs can I ingest into my cluster?” followed by, “How fast can I query these logs?” There are two ways to find out the answers.

Monitoring Ruby on Rails with InfluxDB

Time series databases like InfluxDB are databases that specialize in handling time series data, which is data that is indexed by time. Unlike traditional databases, time series databases are optimized for reading and writing data with less performance consideration for updating or deleting data. Due to the time-dependent nature of time series data, time series databases are handy for application monitoring.

Banking on change: The future of customer service in financial services

Fintech companies around the world are transforming how financial services are delivered. Nowhere is that more apparent than in customer service, especially around complaints, disputes, and fraud operations, with new entrants redefining what good looks like. Australian firms such as Afterpay, the buy-now, pay-later giant acquired by Square, have succeeded by offering new products and experiences in line with consumer preferences for simple, intuitive, streamlined services.

What Are Preload Resource Hints?

Preloads are a powerful optimization technique that can make significant improvements to crucial performance metrics such as Core Web Vitals. I have written on prefetching a DNS lookup or even preconnecting to a domain. Preloading is a much more powerful extension of these concepts because it enables you to download entire resources in advance. In this article, let’s look at.

Tracing errors and surfacing collateral damage across your code base

Frontend technologies typically talk to several services in your backend, and those services talk to other services. At the root of every issue is a single event that causes a domino effect. A domino effect that impacts every operation from the first experience on the frontend to the backend API call. Sentry can show you how these exceptions and latency issues impact every one of your services. For example, take the ever common and seemingly simple to resolve 500 - Internal Server Error.

Receiving PagerDuty alerts from MetricFire

One of the most critical aspects of monitoring your digital assets is getting a timely alert when something goes wrong. Even when you finish building a monitoring stack and expose metrics on a beautifully designed dashboard if you cannot notice abnormal behaviors and fail to take pre-emptive or follow-up actions swiftly, this means your monitoring system does not serve the purpose.