Site Reliability Engineering: How Google Runs Production Systems
Rate it:
Open Preview
Read between August 18, 2017 - January 26, 2019
7%
Flag icon
We define toil as mundane, repetitive operational work providing no enduring value, which scales linearly with service growth.
15%
Flag icon
“If we are engineering processes and solutions that are not automatable, we continue having to staff humans to maintain the system. If we have to staff humans to do the work, we are feeding the machines with the blood, sweat, and tears of human beings. Think The Matrix with less special effects and more pissed off System Administrators.”
31%
Flag icon
The cost of failure is education.
Ricardo
Or is it the reward ?
31%
Flag icon
A blamelessly written postmortem assumes that everyone involved in an incident had good intentions and did the right thing with the information they had.
31%
Flag icon
Blameless culture originated in the healthcare and avionics industries where mistakes can be fatal.
31%
Flag icon
Best Practice: No Postmortem Left Unreviewed