FireHydrant

https://firehydrant.io/

Manhattan, NY, USA

2017

FireHydrant is now AI-powered for faster, smarter incidents

Mar 18, 2024 | By Robert Ross

Over the last five years we’ve seen our customers run 583,954 incidents more efficiently thanks to a shared workspace, powerful Runbook automations, and auto-captured data. Yet despite a great deal of progress, incident efficiency hasn’t achieved peak potential. We talk to a lot of folks that are still stuck in the muck: new responders struggle to get up to speed quickly, incident commanders wade through post-incident drudgery, and knowledge silos prevent comprehensive improvements.

Read Post

Inside the gamedays: how we tested Signals for reliability

Mar 11, 2024 | By Danielle Leong

FireHydrant is mission-critical infrastructure for thousands of engineers. It’s our job to be up – even when everything else is down. Here's a technical look at how we tested Signals alerting and on-call to ensure high availability and speed.

Read Post

3 questions to ask of any DevOps tool in 2024

Mar 6, 2024 | By Robert Ross

Is your DevOps tool stack out of control? I feel like every day, I talk to someone who feels this pain. The technological golden age of the past few years created a lot of niche tools, but now that CFOs and boards alike are demanding budget restraint, many of these tools are being scrutinized. The reality of the situation is that it’s not good enough for a tool to do one thing anymore.

Read Post

Finally: alerting and on-call scheduling for how you actually work

Feb 29, 2024 | By Robert Ross

TL;DR You deserve a better alerting and on-call tool. So we built Signals. In our early days, we often used the tagline, “You just got paged. Now what?” It encapsulated how FireHydrant solved for all of the messy bits that come after your alert is fired, from incident declaration all the way through to retrospective. At the time, we saw alerting and on-call scheduling as a solved problem.

Read Post

New MTTX analytics to drive your reliability roadmap

Feb 14, 2024 | By Milan Thakker

Analytics are great. We can all agree there. But not all analytics are created equal. FireHydrant has long offered incident analytics dashboards that provide an in-depth look at the entire incident lifecycle. You can see how incidents impact services and teams, understand retrospective participation and completion, and even get insight into follow-ups. But great analytics do more than simply organize data. They help you tell a story.

Read Post

The revolution in critical incident response at Dock: efficient integration and service improvement

Feb 13, 2024 | By The FireHydrant Team

In this article, we will explore how Dock is working to significantly enhance its response time to critical incidents, emphasizing effective integration between tools as key to success. We will address how we challenge the conventional approach by shifting the focus from Mean Time to Acknowledge (MTTA) to Mean Time to Combat (MTTC), a customized metric that measures the time between incident detection and effective communication involving professionals capable of resolving it.

Read Post

The alert fatigue dilemma: A call for change in how we manage on-call

Jan 18, 2024 | By Robert Ross

Once the unsung heroes of the digital realm, engineers are now caught in a cycle of perpetual interruptions thanks to alerting systems that haven't kept pace with evolving needs. A constant stream of notifications has turned on-call duty into a source of frustration, stress, and poor work-life balance. In 2021, 83% percent of software engineers surveyed reported feelings of burnout from high workloads, inefficient processes, and unclear goals and targets.

Read Post

Now in beta: alerting for modern DevOps teams

Dec 8, 2023 | By Robert Ross

Although FireHydrant has spent five years focused on what happens after your team (erg, I mean service 🙄) gets paged, the topic of alerting often comes up in discussions with our community. People are tired of paying big bucks for software that’s expensive, bloated, and hasn’t seen much innovation. Clearly, there’s a problem here – and we’re tackling it head on.

Read Post

Captain's Log: Diving into our scheduling design

Dec 5, 2023 | By Robert Ross

On-call scheduling is tricky. Like, really tricky. It was one of the scariest parts when we decided to build a modern alerting system earlier this year. We knew we couldn't cut any corners on Day One of our release because it needed to be a fully loaded feature for someone to realistically use our product (and replace an incumbent). This meant including windowed restrictions, coverage requests, and simple to complex rotations.

Read Post

Your guide to better incident status pages

Nov 30, 2023 | By Jouhné Scott

Your status page (or lack thereof) has the opportunity to signal a lot about your brand — how transparent you are, how quickly you respond to incidents, how you communicate with your customers — and ultimately, this all seriously impacts your reliability. After all, as our CEO Robert put it in a recent interview on the SRE Path podcast, you don’t get to decide your reliability; your customers do.

Read Post

4 Minute Demo of FireHydrant

Feb 29, 2024 | By FireHydrant

Meet the only all-in-one incident management platform that is there with you from the first alert until you learn from the retrospective.

View Video

Better Incidents Winter Bonfire: Inside On-Call

Dec 14, 2023 | By FireHydrant

Engineers are bombarded with pages left and right. There's uncertainty about how to escalate. A constant blur exists between what's urgent and what can wait. This never-ending ping-pong game takes a toll. Burnout creeps in, and your engineering culture has taken a nose dive before you know it.

View Video

Navigating the SRE Landscape| Better Incidents Podcast Ep. 9

Oct 24, 2023 | By FireHydrant

View Video

Alerting, Incident Management and the SDLC | Better Incidents Podcast Ep. 8

Oct 5, 2023 | By FireHydrant

In this episode we chat with veteran cloud architect Masaru Hoshi about the challenges of alert fatigue, the importance of effective alerting systems, and fostering ownership in software teams. Masaru shares insights from his 30-year career, emphasizing the need for balance, trust, and collaboration in incident response.

View Video

Practicing SDLC the right way #shorts #incidentresponse #sre #softwareengineer

Oct 4, 2023 | By FireHydrant

View Video

The problem with noise in Alerting #shorts #incidentresponse #sre #softwareengineer

Oct 4, 2023 | By FireHydrant

View Video

Forget MTTR - focus on assembly time

May 15, 2023 | By FireHydrant

View Video

FireHydrant Slack Incident Management Demo

Jan 22, 2022 | By FireHydrant

In this demo we'll look at how FireHydrant can solve the pains of quickly declaring and managing an incident, all from Slack.

View Video

FireHydrant - Incidents Happen

Sep 26, 2021 | By FireHydrant

See how FireHydrant can help you achieve better reliability, get to resolution, and back to bed quicker.

View Video

FireHydrant Platform Demo - August 2021

Aug 11, 2021 | By FireHydrant

FireHydrant is the only comprehensive reliability platform that allows teams to achieve reliability at scale by creating speed and consistency across the entire incident response lifecycle.

View Video

FireHydrant

Monthly Archive

Follow Us