Showing a photo of his contribution to @alicegoldfuss oncall photo collection, where he looked happy, but didn't know he wasn't yet.
#SREcon
This is where observability comes in - it's a property of your system.
#SREcon
in our new distributed system world, we need to add this type of visibility explicitly.
#SREcon
Our brains + observability output are what replace the cause based alerts.
#SREcon
#SREcon
Rather than having it page, put it on a dashboard. When someone does get based (by burn rate), they can look for it.
#SREcon
#SREcon
Set a goal for how may pages you want to receive; work towards it.
#SREcon
1) symptom based alerts are good
2) SLO is defined by you, customers, system.
3) SLO implies error budget; informs tolerance.
4) Page only on SLO risk, because that's what matters.
#SREcon
#SREcon