Deploying SRE Training Best Practices to Production: How to "SRE" an SRE Training Program

Abstract

This talk addresses how to apply SRE principles and best practices in running a consistent and reliable training program for an SRE team. We’ll look at this from both a technical and operations perspective. We’ll share the importance of giving new SREs hands-on experience with production infrastructure early in an environment that is real but safe for them to learn. We’ll share some challenges that we encountered in building an educational stack and associated curriculum that can be induced to break on demand (e.g., SRE managed platforms are resilient and sometimes you *can’t* easily break them in the ways you want) and approaches to solve for those challenges.

Research Areas