Przemek

51%
Flag icon
Processes crash or may need to be restarted. Hard drives fail. Natural disasters can take out several datacenters in a region. Site Reliability Engineers need to anticipate these sorts of failures and develop strategies to keep systems running in spite of them.
Site Reliability Engineering: How Google Runs Production Systems
Rate this book
Clear rating
Open Preview