Operations | Monitoring | ITSM | DevOps | Cloud

Balancing Reliability at the Crypto-Finance Frontier with Brian Shaw (Uphold)

Sylvain Kalache sits down with Brian Shaw, Senior Engineering Leader at Uphold, to explore the reliability challenges that arise when operating at the intersection of traditional finance and crypto markets. Brian shares how unexpected market events can create massive traffic spikes, how their platform architecture and Kubernetes setup help them stay resilient, and why Uphold's transparency and regulatory approach make them both trustworthy and a high-profile target.

Why Reliability Starts with the Network, even in the AI era, with Marino Wijay

In this episode, we explore how networking has shaped reliability as we know it. Marino Wijay cloud networking expert and Staff Solutions Architect at Kong shares how his journey began not as an SRE, but with cables, routers, and switches. Marino explains the evolution of the fabric holding systems together through virtualization, and how software-defined networking, which is now a key element to resilient applications.

The New Rootly Ringtones: How Research-based On-Call Sounds

We set out to create a ringtone that wasn’t just loud—but the sound of a modern pager. Something that wakes you up, but without triggering a full-blown adrenaline spike. In this video, go behind the scenes with sound engineer Gorjão as he crafts a how research-based on-call sound sounds like.

Metrics That Matter: Measuring Developer Productivity in the AI Era

In this episode, Ryan McDonald is joined by Mark Quigley, Head of Platform Engineering at Ninety.io, for a conversation that cuts through the noise around developer productivity metrics and AI. Mark dives deep into how teams can measure what matters—without falling into the trap of turning every measure into a target. He shares how tools like Developer NPS, DORA metrics, and balanced scorecards can help teams optimize for both output and well-being—but only when framed with the right intent.

Are AI and Platforms Making SRE Obsolete? With Kaspar von Grünberg, Humanitec's CEO

Last year, over 89% of companies claimed to have adopted platform engineering. And, in the past month, LLMs have been disrupting how we think about software development. In this context, Kaspar, asks if the role of Site Reliability Engineers is being obsolete as we know it. Kaspar argues that while SREs aren’t going anywhere, their responsibilities are evolving—fast. We talk about.