AI is changing our reliability response teams
In this clip from an AI roundtable with Gremlin, Nobl9, and PagerDuty, Mandi Walls talks about how AI is bringing AI engineers into incident response teams. Find out how to improve AI reliability with Gremlin → https://www.gremlin.com/solutions/improve-ai-reliability
FULL TRANSCRIPT:
Just like you had a split between SRE and DBRE because there are certain things that you really need to handle in the data and the database layer, the actual ability to change the dials or reevaluate the models or declare that the model needs to be refreshed or whatever the case might be, needs to fall to a different set of engineers than your regular SRE team. This is probably data engineers who are not used to being on call. They're not used to being in the front lines of stuff. They're in the back office with their statistical models and their Python, and they're not usually out in front talking to customers. And their stuff hasn't in the past really faced the customers until now.
It's as much an organizational flex on that side as it is anything else is. We're gonna need to call in the folks who know what's going on. That's been our model from the beginning. You page the people who know the thing, and if that's the web application developers or if it's the machine learning developers, we're seeing a surge, a change in that so bring those folks more into the response team.