Continuing the #QConLondon production track with @yurynino, and using visual metaphors to understand our production data in a different way.
"In our field, observability is about humans and about how humans interact with technology" @yurynino#QConLondon
Collecting metrics and signals are only one part of the solution - observability has to come with good visualisation, and engineering a solution for humans. @yurynino#QConLondon
Chaos engineering isn't just about injecting the chaos, it's about increasing knowledge about our system (through observability) and overcome its weaknesses. @yurynino#QConLondon
Introducing visual metaphors: displaying data and metrics in real world shapes and analogies. @yurynino#QConLondon
When surveyed, only 55% of engineers answered correctly when looking at a standard line graph - but they all got it right when viewing it in a city metaphor @yurynino#QConLondon
Visual metaphors as alternative solutions to hard to read bar charts. For traffic, using nodes and edges of variable size - and for latency using color coded gauges. @yurynino#QConLondon
"If we have proper visualisation and better metaphors, we set much better conditions for our operators to be comfortable in understanding and responding to variations in our systems." @yurynino#QConLondon
• • •
Missing some Tweet in this thread? You can try to
force a refresh
Here we go, last talk of the day on the production track at #QConLondon, with @rdelvira and "an entertaining outage story" (his own words) when slack rolled out DNSSEC
"Who here tried to rollout DNSSEC?, Ok one person... Now how failed when trying to rollout DNSSEC? Welcome to the club!" 😂 @rdelvira#QConLondon
"We planned DNSSEC carefully, with the necessary changes and replicated most of our DNS use cases... And you'll see later why I said 'most'..." @rdelvira#QConLondon
"Observability is the capability to continuously generate and discover actionable insights based on signals from the system under observation with the goal to influence that system" and that's for both people (eg debugging) and automation (eg autoscaling) @mhausenblas#QConLondon
Observability can go beyond usual metrics, logs and traces: @mhausenblas introducing profiles and eBPF #QConLondon
First talk of the day on the #QConLondon production track, by @glenathan and a challenge: can we build observable services without logs?
"We needed to build a new service in Go, without our usual existing scaffolding in Clojure... That led to some bikeshedding but also gave a chance for experimentation!" @glenathan#QConLondon
"Before this, we spent a lot of money to know what our applications were doing in production"