Site Reliability Engineering: How Google Runs Production Systems
Rate it:
Open Preview
3%
Flag icon
SRE is what happens when you ask a software engineer to design an operations team.
4%
Flag icon
Monitoring should never require a human to interpret any part of the alerting domain. Instead, software should do the interpreting, and humans should be notified only when they need to take action.