Abstract
This talk addresses how to apply SRE principles and best practices in running a consistent and reliable training program for an SRE team. We’ll look at this from both a technical and operations perspective. We’ll share the importance of giving new SREs hands-on experience with production infrastructure early in an environment that is real but safe for them to learn. We’ll share some challenges that we encountered in building an educational stack and associated curriculum that can be induced to break on demand (e.g., SRE managed platforms are resilient and sometimes you can’t easily break them in the ways you want) and approaches to solve for those challenges.
Research Areas
Learn more about how we do research
We maintain a portfolio of research projects, providing individuals and teams the freedom to emphasize specific types of work