Rootly

How AI broke serverless and what to do about it with Vercel's Mariano Fernández Cocirio

Mar 6, 2025 By Rootly In Rootly

Mariano, Staff Product Manager at Vercel, explains why serverless architectures are hitting unexpected limits—they’re too fast. The industry has spent millions optimizing serverless for speed, but AI workloads are changing the game. In the AI realm, slower execution often leads to better results. The challenge? Paying for all that idle compute time while waiting for AI responses.

View Video

Rootly

Read more about How AI broke serverless and what to do about it with Vercel's Mariano Fernández Cocirio

I Want My Shoes Fast! Observability, SRE Burnout, and OTel with Dynatrace's Adriana Villela

Feb 27, 2025 By Rootly In Rootly

In this episode, we sit down with Adriana Villela, Principal DevRel at Dynatrace and OpenTelemetry contributor to break down how observability impacts reliability. We dive into what contributes to SRE burnout and how managers can create psychologically safer spaces for responders. Adriana also shares her perspective on AI as an observability-buddy to navigate incidents.

View Video

Rootly

Read more about I Want My Shoes Fast! Observability, SRE Burnout, and OTel with Dynatrace's Adriana Villela

AI in Production with GitHub's Sean Goedecke

Feb 18, 2025 By Rootly In Rootly

In this episode, we sit down with Sean Goedecke, Staff Software Engineer at GitHub, to discuss where LLMs fit into real-world development. Sean shares how he’s using LLMs how he’s drawing the line for AI-assistance in the codebases he manages—though, as he says, this might all change by next summer. Sean also weighs in on how LLMs could assist SREs during outages—especially when you’re only half-awake at 3 a.m. after a rather inconvinient page.

View Video

Rootly

Read more about AI in Production with GitHub's Sean Goedecke

The Domino Effect of Outages with Nuno Tomás, Founder of isDown.app

Jan 24, 2025 By Rootly In Rootly

Humans of Reliability: Keeping systems up and the lights on isn’t just about technology—it’s about the people behind it. In this episode, we’re thrilled to chat with Nuno Tomas, founder of Isdown.app, a vendor outage monitoring tool transforming how teams handle third-party incidents. Nuno shares his journey from software engineer to entrepreneur, the pivotal 4 a.m. moment that inspired Isdown, and the challenges of balancing startup life with family. We dive into the complexities of incident communication, how to tackle alert fatigue, and why transparency is key to building trust in SaaS.

View Video

Rootly

Read more about The Domino Effect of Outages with Nuno Tomás, Founder of isDown.app

How Stress Affects Our Learning Abilities in Incidents (And What To Do About It)

Aug 6, 2024 By Sorrel Harriet In Rootly

While retrospectives provide a valuable pathway for learning outside of the flow of work, we also want learning to happen during an incident or unexpected event as it unfolds. This can be challenging due to the negative impact of stress on our ability to learn and navigate difficult situations. In this article, we’ll dig into how stress inhibits our ability to learn and what we can do about it.

Read Post

Rootly

Read more about How Stress Affects Our Learning Abilities in Incidents (And What To Do About It)

The Best SRE Tools To Improve Reliability and Streamline Operations

Jul 31, 2024 By Iryna Iurchenko In Rootly

For better or worse, most companies—including their execs and developers—see SREs as superheroes who’ll save them from the evils of downtime and service degradation with their boundless superpowers. SREs are expected to constantly perform dangerous stunts like production debugging or communicating highly technical issues to angry VPs. They must also be able to manage infrastructure, networks, databases, pipelines, operating systems and much more.

Read Post

Rootly

Read more about The Best SRE Tools To Improve Reliability and Streamline Operations

Creating Schedules + Escalation Policies with Rootly On Call

Jul 26, 2024 By Rootly In Rootly

Ashley walks you through how to create a schedule and escalation policy using Rootly On-Call, a modern on-call and incident management solution. rootly.com/on-call.

View Video

Rootly

Read more about Creating Schedules + Escalation Policies with Rootly On Call

Rootly On-Call: On-Call Shadowing Feature

Jul 26, 2024 By Rootly In Rootly

Shadowing experienced responders is one of the most effective ways for folks who are new to on-call to gain the confidence and knowledge to handle incidents independently. Traditionally, shadow rotations are cumbersome to set up, involving duplicating and editing an existing schedule. For Rootly On-Call users, setting up shadow rotations couldn’t be easier with our new native Shadowing feature. Here are a few highlights.

View Video

Rootly

Read more about Rootly On-Call: On-Call Shadowing Feature

Beyond MTTR: 7 incident metrics that matter and 3 that don't

Jul 24, 2024 By Ashley Sawatsky In Rootly

Pets.com was an online pet supply retailer founded in 1998, during the dot-com craze. In February 2000, it raised $83 million to go public based mainly on metrics like user acquisition, website traffic, and brand recognition. However, the profit margins were minimal and the marketing costs exorbitant, which led Pets.com to file for bankruptcy nine months after its IPO. The industry now recognizes these metrics as vanity metrics.

Read Post