Scientific Incident Management with Dan Slimmon
Dan Slimmon is an incident management veteran who's worked at Etsy, HashiCorp, and now leads consulting and training on pragmatic, non-bureaucratic incident response.
In this episode, Dan shares his philosophy on "scientific incident response," the importance of hypothesis-driven troubleshooting, and why incidents should be seen as normal in complex systems.
We also explore:
- Why asking the right questions is more important than knowing all the answers.
- How to use nerd sniping to unlock insights from engineers.
- Common failure patterns he sees across organizations.