Scaling AI Reliability: Real world lessons from Mistral AI

Checkly

Jan 28, 2026

How does one of the world's leading AI companies keep its infrastructure reliable while shipping new models constantly? In this webinar, Devon Mizelle, Senior SRE at Mistral AI, shares the real story.

Devon walks through how Mistral built an automated system that generates synthetic checks for every model the moment it goes live—no manual configuration, no forgotten monitors, no inconsistent alerting. Using monitoring as code, his team eliminated the toil of maintaining hundreds of checks across a rapidly evolving model ecosystem.

But this isn't just a technical deep dive. The conversation explores where observability is headed in the AI era: What happens when agents get paged before humans? How close are we to self-healing systems? And what does this mean for the future of SRE?

Featuring: