Building Blocks for Site Reliability
Abstract
How does Google run reliable systems? At the heart of Site Reliability Engineering is the idea of treating reliability as a software problem and and asking software engineers to design an operations function. This talk will examine the organizational, conceptual and technological building blocks that together comprise the concept of site reliability engineering at Google.